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TITT ,F nT? IHE TNVFNTIQN 

GERMLINE MUTATIONS IN THE MTS GENE AND METHOD FOR DETECTING 
PREDISPOSITION-TO CANCER AT THE MTS GENE 

fpnss RFFFRF Nr^ IJQ EELAIED APPT ICAHOMS 

The present invention is a continuation-m-part of appUcation Serial Nos. 08/231,938, filed .1 
June 1994, 08/215,087, filed 18 Mak* 1994, and 08/215.086, filed 18 March 1994, which are all 
incorporated herein by reference. AppUcation Serial No. 087251,938 in turn is a continuation-in- 
part 0 f application Serial No. 08/227369, filed 14 April 1994, which is a continuation-in-part of 
appUcation Serial No. 08/214,582, filed 18 March 1994, which are all incorporated herein by 
reference. 

P AnffiBffl TNQ QV THF TNVFNTTON 

The present invention relates to somatic mutations in the Multiple Tumor Suppressor (MTS) 
gene in human cancers and their use in the diagnosis and prognosis of human cancer. The 
invention further relates to gennline mutations in the MTS gene and their use in the diagnosis of 
predisposition to cancer, such as melanoma, ocular melanoma, leukemia, astrocytoma, 
glioblastoma, lymphoma, glioma, Hodgkin's lymphoma, multiple myeloma, sarcoma, 
myosarcoma, cholangiocaidnoma, squamous cell carcinoma, CLL, and cancers of the pancreas, 
breast, brain, prostate, bladder, thyroid, ovary, uterus, testis, kidney, stomach, colon and rectum. 
The invention also relates to the therapy of human cancers which have a mutation in the MTS 

gene, in cludin g gene therapy, protein replacement therapy and protein mimetics. Finally, the 

invention relates to the screening of drugs for cancer therapy. 

The publications and other materials used herein to illuminate the background of the 

invention, and in particular, cases to provide additional details respecting the practice, are 

incorporated herein by reference, and for convenience are referenced in the following text and 

respectively grouped in the appended List of References. 

The genetics of cancer is complicated, involving multiple dominant, positive regulators of 

the transformed state (oncogenes) as weU as multiple recessive, negative regulators (tumor 
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suppressor genes). Over one hundred oncogenes have been characterized. Fewer than a dozen 
tumor suppressor genes have been identified, but the number is expected to increase beyond fifty 
(Knudson, 1993). 

The involvement of so many genes underscores the complexity of the growth control 
mechanisms that operate in cells to maintain the integrity of normal tissue. This complexity is 
manifested in another way. So far, no single gene has been shown to participate m the 
development of all, or even the majority of human cancers. The most common oncogenic 
mutations art in the H-ras gene, found in 10-15% of all solid tumors (Anderson et aL, 1992). The 
most frequently mutated tumor suppressor gene is the pS3 gene, mutated in roughly 50% of all 
tumors, without a target that is common to all transformed cells, the dream of a "magic bullet" 
that can destroy or revert cancer cells while leaving normal tissue unharmed is improbable. The 
hope for a new generation of specifically targeted antitumor drugs may rest on the ability to 
identify tumor suppressor genes or oncogenes that play general roles in control of cell division. 

The tumor suppressor genes, which have been cloned and characterized, influence suscepti- 
bility to: 1) retinoblastoma (RBI); 2) Wilms* tumor (WT1); 3) Li-Fraumeni fTP53); 4) Familial 
adenomatous polyposis (APC); 5) Neurofibromatosis type 1 (NF1); 6) Neurofibromatosis type 2 
(NF2); 7) von Hippel-Lindau syndrome (VHL); and 8) Multiple endocrine neoplasia type 2A 
(MEN2A). 

Tvmor suppressor loci that have been mapped genetically but not yet isolated include genes 
for Multiple endocrine neoplasia type 1 (MEK1); Lynch cancer family syndrome 2 (LCFS2); 
Familial breast cancer (BRCA1); Neuroblastoma (NB); Basal cell nevus syndrome (BCNS); 
Beckwith-wledemann syndrome (BWS); Renal cell carcinoma (RCQ; Tuberous sclerosis 1 
(TSC1); and Tuberous sclerosis 2 (TSC2). The tumor suppressor genes that have been charac- 
terized to date encode products with similarities to a variety of protein types, including DNA 
binding proteins (WT1), ancillary transcription regulators (RBI), GTPase activating proteins or 
GAPs (NF1), cytoskeletal components (NF2), membrane bound receptor kinases (MEN2AX and 
others with no obvious similarity to known proteins (APC and VHL). 

In many cases, the tumor suppressor gene originally identified through genetic studies has 
been shown in some sporadic tumors to be lost or mutated. This result suggests that regions of 
chromosomal aberration may signify the position of important tumor suppressor genes involved 
both in genetic-predisposition to cancer and in sporadic cancer. - — — 
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One of the hallmarks of several tumor suppressor genes characterized to date is that they are 
deleted at high frequency in certain tumor types. The deletions often involve loss of a single allele, 
a so-called loss of heterozygosity ' (LOH), but may also involve homozygous deletion of both 
alleles. For LOH, the remaining allele is presumed to be nonfunctional, either because of a 
preexisting inherited mutation, or because of a secondary sporadic mutation. 

Melanoma is a common cancer afflicting one in every hundred Americans (American Cancer 

Society, 1992). Environmental inflvrr"y« > such as exposure to ultraviolet light, play a large role in 
melanoma incidence, but heredity is also a contributing factor. A gene for familial melanom a, 
MLM, has been mapped to chromosome 9p21 (Cannon-Albright et aL, 1992; Nancarrow et at, 
1993; Gruis et ah, 1993; Goldstein et aL, 1994). Possession of a single predisposing allele at the 
MLM locus increases the probability that an individual will develop melanoma by up to 
approximately 50-fold MLM belongs to the growing family of suspected tumor suppressor genes. 
Predisposition to melanoma is inherited as a dominant MendeUan trait, yet predisposing mutations 
in MLM are thought to act as somatic recessive alleles in the manner originally proposed by 
Knudson (1971). In a predisposed individual who carries one wild-type and one mutant MLM 
allele, dividing cells undergo secondary mutational events that involve loss or inactivation of the 
wild-type copy of MLM, thereby uncovering the inherited mutant MLM allele. Conversely, a 
single wild-type copy of the gene prevents the onset of malignancy. 

Chromosomal aberrations In the vicinity of MLM at 9p21 have been extensively 
characterized in several different tumor types, including glioma cell lines, non-small cell lung lines 
and acute lymphoblastic leukemia lines (Olopade et aL, 1 992; Olopede et aL, 1993; Lukeis et aL, 
1990; Diaz etaL, 1988; Nfiddleton et aL, 1991; Fountain etaL, 1992; Cheng ctaL, 1993; James et 
aL, 1993). Thus, based on the frequency of 9p21 chromosomal abnormalities in non-melanoma 
tumor cells, it is probable the MLM region contains a gene (or genes) that participates at lead in 
the progression of several different tumor types. These events involve LOH as well as a high 
frequency of homozygous deletion. 

Cells in tissues have only three serious options in life - they can grow and divide, not grow 
but stay alive, or die by apoptosis. Tumors may arise either by inappropriate growth and division 
or by cells foiling to die when they should One of the mechanisms for controlling tumor growth 
might involve direct regulation of the cell cycle. For example, genes that control the decision to 
initiate DN A* replication are attractive candidates for oncogenes or tumor suppresso^gmes, 
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dcpcnding on whether they have a stimulatory or inhibitory role in the process. Progression of 
eukaryotic cells through the cell cycle (G„ S f G 2 and M phases) is governed by the sequential 
formation, activation and subsequent inactivation of a series of cyclin/cyclin-<iependent kinase 
(Cdk) complexes. Cyclin D's/Cdk2,4,5, Cyclin E/Cdk2, Cyclin A/Cdk2 and Cyclin B/A/Cdk2 
have been shown to be involved in this process. Cyclin D's and Cdk2, Cdk4 and Cdk5 have been 
implicated in the transition from G 4 to S; that is, when cells grow and decide whether to begin 
DNA replication. Additional cell cycle control elements have recently been discovered. These 
elements are inhibitors of Cdks (Cdk inhibitors, CklX and include Farl, p21. p40, p20 and pl6. 
(Mane, 1994; Nasmyth & Hunt, 1993). 

Recently, several oncogenes and tumor suppressor genes have been found to participate 
direcdy in the cell cycle. For example, one of the cyclins (proteins that promote DNA replication) 
has been implicated as an oncogene (Motokura et aL, 1991; Lammie et aL, 1991; Withers et aL, 
1991; Rosenberg et aL, 1991), and tumor suppressor Rb interacts with die primary cyclin-binding 
partners, the Cdks (Ewen et al., 1993). Identification of a melanoma susceptibility locus would 
open the way for genetic screening of individuals to assess, for example, the increased risk of 
cancer due to sunlight exposure. The MTS may also predispose to a large number of other cancer 
sites, including but not limited to, leukemia, astrocytoma, glioblastoma, lymphoma, glioma, 
Hodgkin's lymphoma, multiple myeloma, sarcoma, myosarcoma, cholangiocarcmoma, squamous 
cell carcinoma, CLL, and cancers of the pancreas, breast, brcn, prostate, bladder, thyroid, ovary, 
uterus, testis, kidney, stomach, colon and rectum. In addition, since MTS influences progression of 
several different tumor types, it should be useful for determining prognosis in cancer patients. 
Thus, MTS may serve as the basis for development of very important diagnostic tests, one capable 
of predicting the predisposition to cancer, such as melanoma, ocular me la nom a, leukemia, astrocy- 
toma, glioblastoma, lymphoma, glioma, Hodgkin's lymphoma, multiple myeloma, sarcoma, 
myosarcoma, cholangiocarcinoma, squamous cell carcinoma, CLL, and cancers of the pancreas, 
breast, brain, prostate, bladder, thyroid, ovary, uterus, testis, kidney, stomach, colon and rectum, 
and one capable of predicting the prognosis of cancer. Furthermore, since MTS is involved in the 
progression of multiple tumor types, MTS may provide the means, either directly or indirectly, for 
a general anti-cancer therapy by virtue of its ability to suppress tumor growth. For example, 
restoration of the normal MTS function to a tumor cell may transmute the cell .into non- 
malignancy. — ' . . — — ^— 
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<CT IMVf ARV OF THF IMVFNTION 

The present invention relates to somatic mutations in the Multiple Tumor Suppressor (MIS) 
•■ gene in human cancers and their use in the diagnosis and prognosis of human cancer* Hie 

5 invention further relates to gennline mutations in the MTS gene and their use in the diagnosis of 
predisposition to many cancers, such as melanoma, ocular melanoma, leukemia, astrocytoma, 
glioblastoma, lymphoma, glioma, Hodgkin's lymphoma, multiple myeloma, sarcoma, 
myosarcoma, cholangiocardnoma, squamous cell carcinoma, CLL, and cancers of the pancreas, 
breast, brain, prostate, bladder, thyroid, ovary, uterus, testis, kidney, stomach, colon and rectum. 
10 Hie invention also relates to the therapy of human cancers which have a mutation in the MTS 
gene, including gene therapy, protein replacement therapy and protein mimetics. Finally, the 
invention relates to the screening of drugs for cancer therapy. 

BRIEF DESCRIPTION OF THE DRAWINGS 

15 

Figure 1A shows Kindred 3137. All melanoma cases carry the susceptible haplotype. Other 
cancers in individuals carrying the susceptible haplotypes are also shown. The legends art as 
follows: filled circle or square indicates melanoma; partially filled circle or square indicates other 
cancer, T indicates deceased; "•" indicates that individual is unknown with regard to Misceptible 

20 haplotype; indicates that individual appears to cany susceptible haplotype; and "35" indicates 
age of ***** or dia gnos is if a ffec t e d 

Figure IB shows Kindred 3161. All melanoma cases carry the susceptible haplotype. Other 
cancers have not been haplotyped. The legends are as follows: filled circle or square indicates 
melanoma; partially filled circle or square indicates other cancer, T indicates deceas ed ; "=" 

25 indicates appears elsewhere in kindred; "3" in pe n tagon indicates multiple marriage; and "35" 
indicates age of exam or diagnosis if affected. 

Figure 1C shows Kindred 3355. All melanoma cases cany the susceptible haplotype. Other 
cancers have not been haplotyped. The legends are as follows: filled circle or square indicates 
melanoma; partially filled circle or square indicates other cancer, T indicates deceased; and "35" 

30 indicates age of exam or diagnosis if affected. 
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Figurt ID shows Kindred 1771 and the occurrence of melanoma and other cancers. A 
mutation was identified in MTS in this kindred The legends are as follows: an "•" indicates a 
confirmed mutation carrier, filled circle or square indicates melanoma; partially filled circle or 
square indicates other cancer (colon in this kindred); V indicates deceased; and "35" indicates age 
of exam or diagnosis if affected- 
Figure 2 shows YAC and PI clones in the region bounded by IFNA-s and D9S171. The 
centromere is to the right For PI clones, die arrow points in the direction of the T7 promoter 
sequence in the vector. YACs that are grouped together represent clones that are similar based on 
mapping STSs in the region. These YACs are presumed not to be identical. YACs A5,B11, C6 
and F9 contain IFN-1 and IFN-s. YACs Dl, FS and E3 contain D9S126 and D9S171 . Neither the 
proximal ends of YACs that include D9S171 nor the distal ends of YACs that include IFNA-s are 
shown. Distances are not necessarily drawn to scale. The markers internal to IFNA-s and D9S1 71 
are depicted in Figure 2. Markers that begin with "c" are derived from cosmid end sequences. The 
cosmids are not shown. The distances between cl.b and c53 and between 760-L and D9S171 are 
unknown. 

Figure 3 shows a diagram of deletions observed in melanoma cell lines. The deletions fell 
into 12 classes, based on the set of markers which are deleted. Eleven cell lines lacked all markers 
depicted in the figure. This class is not shown. Hie number of re p resenta tives of each of the 12 
other classes is shown in the column labeled "# lines." Locations of the deletion breakpoints for 
classes 1-10 are portrayed as felling at the marker adjacent to the deleted DNA; that is, the last 
positive marker in the series leading up to the deletion. For classes 11 and 12, the sites of deletions 
are shown by filled triangles. 

Figure 4A shows a map of cosmid cS. Relevant STSs used for the deletion analysis are 
shown, as are cosmids and Pis. The cl.b marker lies proximal to Pl-1062 and is not shown. The 
transcriptional orientations of MTS1 and MTS2 are shown by arrows. 

Figure 4B shows a restriction map and STS map of cosmid cS. Positions of coding exons for 
MTS1 and MTS2 are shown as thick bars. "El" and "E2" mean "coding exon 1" and "coding exon 
2," respectively. "B" is BamK, "S" is Sail, "Rl" is EcoRI and "R5" is EcoRV. 

Figures 5A and 5B show a comparison of the genomic sequence containing a 5* untranslated 
region, exon l f and part of intron 1 forMTSl with the published sequence for pl6 (Serrano etal., 
1993). The start codon (underlined) is located at position 891 and a splice site-(anow) at paction 
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1016. The MTS1 sequence shown in Figures 5A-B is SEQ ID NO:3. The pl6 sequence shown in 
Figure 5B is SEQ ID N024. 

Figures 6A and 6B show a comparison of the genomic sequence containing part of intron 1, 
exon 2 and part ofintron 2 for MTSl.wim unpublished sequence for pi 6 (Senanoetal^ 1993). 

5 Splice sites (arrows) are located before position 192 and after position 498. The MTS1 sequence 
shown in Figures 6A-B is SEQ ID NO:4. The pl6 sequence shown in Figure 6A is identical to 
nucleotides 192-498 of SEQ ID NO:4. 

Figures 7A and 7B show a comparison of the genomic sequence containing part of intron 1, 
■exon,V and follow-through sequences for MTS2 with the published pi 6 sequence. The "Exon 2" 

1 o sequence is similar to exon 2 of MTS 1 from nucleotides 273 to 580. The splice site in MTS2, and 
those in pl6, are shown by anows. The point where divergence begins is indicated by °. The 
tennination codon for MTS2 is present in exon 2 at position 532 and is indicated byan u * w . The 
MIS2 sequence shown in Figures 7A-B is SEQ ID NO:5. The pl6 sequence shown in Figure 7A 
is identical to nucleotides 192-498 of SEQ 1DN0:4. 

15 Figure 8 shows a comparison of the MTS1 and MTS2 DNA sequences including exon 2 and 

part of each surrounding intron. The positions of the 3' spUce junction of intron 1 and the 5* splice 
junction of intron 2 for MTS1 are shown by triangles. The divergence point near the 3' end of 
eroding exon 2 is indicated by an arrow. The MTS1 sequence shown corresponds to nucleotides 
92-548 c'SEQ ID NO:4. The MTS2 sequence shown corresponds to nucleotides 174430 of SEQ 

20 IDNO:5. 

Figure 9 shows deletions in tumor cell lines of various STSs. Positive controls and negative 
controls were included in every PCR experiment and cell lines in which only one or two of the 
STSs were deleted (e.g., class 21) were retested at least twice. 

figures 10A-C show expression ofMTS2 mRNA Figure 10A shows the relative level of 

25 MTS2 transcript in RNA (Clonetech) derived from various human tissues: lane 1-brain; lane 2- 
breast; lane 3-kidney; lane 4-lung; lane Myinphocyte; lane 6-ovary; lane 7-pancreas; lane 8- 
prostate; lane 9-spleen; lane 10-stomach; lane 11 -thymus. The origins of products with Afferent 
than expected molecular weights (see lane 1) are unknown. Figure 10B shows the relative MTS2 
transcript level in human lymphocytes as a function of time after mitogenic induction: lane 1 - 0 

3 0 hours; lane 2 - 1 hour, lane 3 - 2 hours; lane 4 - 4 hours; lane 5 - 8 hours; lane 6 - 1 6 hours; Jane 7 - 
24 hours; lane 8 - 32 hours; lane 9 - 40 hours; lane 10 - 48 hors; lane 1 1 ♦ 56 hours; lan^J2^..64 
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houis. A majority of the cells were in S phase 40-50 hours after induction. Figure 10C shows 
MTS2 transcript level as a function of Rb status. The Kb* cell lines are: Lane 1 - KIT (Hon et al., 
1987); lane 2 - diploid human fibroblast MRC5, passage 28; lane 3 - UMSCC2; lane 4 - Bristol 8; 
lane 5 - ZR75; lane 6 - HaCaT; lane 7 - T24. The Rb - cell lines are: lane 8 - MDA MB 468; lane 9 
. 5637; lane 10 - C33A; lane 1 1 - SiHa; lane 12 - CaSki; lane 13 - WERL 

Figure 1 1 shows the cDNA sequence (and the encoded polypeptide) for MTS2 including 5'- 
untranslated region The beginning of exon 2 is located at position 491 and is indicated by an 
arrow. The cDNA sequence is shown as SEQ ID NO:15 and the amino acid sequence is shown as 
SEQIDNO:16. 

Figures 12A and 12B show the cDNA sequence (and the encoded polypeptide) of 
MTSlEip. Splice sites are indicated by arrows. Exon 2 begins at position 335 and exon 3 begins 
at position 642. The cDNA sequence is shown as SEQ ID NO: 13 and the amino acid sequence is 
shownbySEQIDNO:14. 

Figure 13 is a physical map of the P16 region. The positions of exon la (Ela), exon IB 
(El0),exon 2 (E2) and exon 3 (E3) are indicated by the filled boxes. The positions of restriction 
sites Eco Rl (Rl), Eco RV (RV), and Sal I (S) are indicated. Above the restriction map are 
genomic clones cosmid c5 and PI 1063. Below the map are the deletions in cell lines A375 and 
SK-mel 93. The dashed line represents deleted DNA. 

Figure 14 shows the aligmrent between mouse and human P16 0 transcript sequences. 
Capital letters indicate identical nucleotides. The stop codons in the pl6 reading fiame are 
underlined The splice junction between E1B and E2 is indicated whh a caret (v). The mouse 0 
sequence is shown as SEQ ID N025. The human 0 sequence is identical to nucleotides 193-461 
ofSEQIDNO:l3. 

Figure 15 shows the expression of the a transcript in cell lines that contain deletions of El (J. 
cDNA was derived from total RN A isolated from the Indicated samples. A radio-labeled primer 
was included in the reactions to amplify the P16 transcripts. Equal volumes of the a end 0 
amplifications were mixed, and the products were resolved on a denaturing 5% polyacrylarnide 
gel: lane 1 - quiescent T cells; lane 2 - cell line SK-mel 93; lane 3 - cell line A375. 

Figures 16A-D show the expression ofP16 transcripts. A radio-labeled primer was included 
in the reactions to amplify the P16 transcripts and the products were resolved on a denaturing 5% 
poryacrylamiae'geL In Figures 16A and 16D the o and 0 reactions from a common sainpje-were 



WO 9S/2S813 



PCT/US9S/03537 



-9- 



mixed prior to electrophoresis. Figure 16A shows the relative levels of P16 transcripts in RNA 
derived from various" human tissues: lane 1, brain; lane 2, breast; lane 3, kidney; lane 4, lung; lane 
5, lymphocyte; lane 6, ovary, lane 7, pancreas; lane 8, prostate; lane 9, spleen; lane 10, stomach; 
lane 1 1, thymus. Figure 16B shows the relative amount of the p transcript in human lymphocytes 
5 as a function oftimeafuimitogenic induction; lane 1,0 hours; lane 2, 1 hour, lane 3, 2 hours; lane 
4, 4 hours; lane 5, 8 hours; lane 6, 16 bourn; lane 7, 24 noun; lane 8, 32 hours; lane 9, 40 hours; 
lane 10, 48 hours; lane 1 1, 56 hours; lane 12, 64 hours. Figure 16C shows the relative amount of 
the a transcript in human lymphocytes as a function of time after mitogenic induction: Lanes, 
same as in Figure 16B, but the 1 hour time point was omitted, The expression of other molecules 
10 that are either suspected to influence cell-cycle progression or that are regulated at the 
transcriptional level during the cell-cycle was also analyzed. In agreement with previous results, 
levels of CDK4 and GoS 2 (a molecule of unknown function, but whose transcription is induced 
when quiescent T cells enter the cell cycle) increased upon T cell induction (Russell and Forsdyke, 

1991; Matsushirae et al„ 1992; Geng and Weinberg, 1993). In contrast, the RNA levels of p27 
L5 appeared unchanged during the course of the experiment (Toyoshima and Hunter, 1994; Kato et 

aL, 1994). Figure 16D shows P16 transcripts as a function of Rb status. Rb" cell lines: lane 1. 

WERfclane2,CaSki;lane3,SiHa;lane4,C33A^ Rb*cell 

lines: lane 7, T24; lane 8, HaCaT; lane 9, Zr75; lane 10, Bristol 8; lane 11, UMSCC2; lane 12. 

diploid human fibroblast MRC5, passage 28; lane 13, KIT (H< ;.i et aL, 1987). 
2 0 Figure 17 shows the cDNA sequence for MTSl including noncoding portions of the cDNA. 

The trian gl es indicate splice junctions. The dashes in the sequence at the second splice junction 

only emphasize this splice junction, they do not indicate missing bases. This sequence is SEQ ID 

N036. 

25 nETATT.Fn DES fPTPTtniJ OF THE TNVFNTiON 

The present invention relates to somatic mutations in the Multiple Tumor Suppressor (MTS) 
gene in h uman cancers and their use in the diagnosis and prognosis of human cancer. The 
invention further relates to germ line mutations in the MTS gene and their use in the diagnosis of 
30 predisposition to various cancers, such as melanoma, ocular melanoma, leukemia,-' astrocytoma, 
glioblastoma," -lymphoma, glioma, Hodgkin's lymphoma, multiple- myeloma, sajcoma, 
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myosarcoma, choiangiocarcinoma, squamous cell carcinoma, CLL, and cancers of the pancreas, 
breast, brain, prostate, bladder, thyroid, ovary, uterus, testis, kidney, stomach, colon and rectum. 
The invention also relates to the therapy of human cancers which have a mutation in the MTS 
gene, including gene therapy, protein replacement therapy and protein mimetics. Finally, the 
invention relates to the screening of drogs for cancer therapy. 

The present invention provides an isolated polynucleotide comprising all, or a portion of the 
MTS locus or of a mutated MTS locus, preferably at least eight bases and not more than about 1 00 
Kb in length. Such polynucleotides may be antisense polynucleotides. The present invention also 
provides a recombinant construct comprising such an isolated polynucleotide, for example, a 
recomb inan t construct suitable for expression in a transformed host cell 

Also provided by the present invention are methods of detecting a polynucleotide comprising 
a portion of the MTS locus or its expression product in an analyte. Such method may further 
comprise the step of amplifying the portion of the MTS locus, and may fiirtfaer include a step of 
providing a set of polynucleotides which are primers for amplification of said portion of the MTS 
locus. The method is useful for either diagnosis of the predisposition to cancer or the diagnosis or 
prognosis of cancer. 

The present invention also provides isolated antibodies, preferably monoclonal antibodies, 
which specifically bind to an isolated polypeptide comprised of at least five amino acid residues 
encoded by the MTS locus. 

The present invention also provides lots for detecting in an analyte a polynucleotide 
comprising a portion of the MTS locus, the kits comprising a polynucleotide complementary to the 
portion of the MTS locus packaged in a suitable container, and instructions for hs use. 

The present invention further provides methods of preparing a polynucleotide comprising 
polymerizing nucleotides to yield a sequence comprised of at least eight consecutive nucleotides of 
the MTS locus; and methods of preparing a polypeptide comprising polymerizing amino acids to 
yield a sequence comprising at least five amino acids encoded within the MTS locus. 

In addition, the present invention provides methods of screening drugs for cancer therapy to 
identify suitable drugs for restoring MTS gene product function. 

Finally, the present invention provides the means necessary for production of gene-based 
therapies directed at cancer cells. These therapeutic agents may take the form of polytoiffileotides 
comprising all of a portion of the MTS locus placed in appropriate vectors or delivered xcrgEgi ; 
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cells in more direct ways such that the function of the MTS protein is reconstituted Therapeutic 
agents may also take the form of polypeptides based on either a portion o£ or the entire protein 
sequence of MTS. These may functionally replace the activity of MTS in vivo. 

It is a discovery of the present invention that die MTS locus (referr e d to in the prior ait as 
5 Melanoma (MLM) locus), which predisposes individuals to melanoma and other cancers, is a gene 
encoding MTS1, which has been found to be an inhibitor of Cdks, particularly Cdk4. This gene is 
termed MTS 1 herein. It is also a discovery of the present invention that the MTS locus contains a 
second coding sequence, termed MTS2, which is veiy similar to MTS1 over part of its sequence. It 
is also a discovery of the present invention that the MTS 1 gene has two separate promoters - a and 

10 p. When the a promoter is used the resulting mRNA is composed of exon la, exon 2 and exon 3. 
This is referred to as MTS1. When the 0 promoter is used the resulting mRNA is composed of 
. exon exon 2 and exon 3. This is referred to as MTS1E10. It is a discovery of die present 
invention that mutations in the MTS locus in the gennline are indicative of a predisposition to 
melanoma and to other cancers. Finally, it is a discovery of the present invention that somatic 

15 mutations in the MTS locus are associated with most, if not all tumor types, and thus repres ent a 
general indicator of cancer or of prognosis of cancer. The mutational events of the MTS locus can 
involve deletions, insertions and point mutations within the coding sequence and the non-coding 
sequence. 

The MLM locus was first located genetically by showing dramatic linkage in several Utah 
20 kindreds and one Texas kindred between genetic markers and melanoma predisposition (Cannon- 
Albright, 1992). The region defined by recombinants in the kindreds is flanked by D9S736 and 
D9S171. Subsequently, these and other genetic markers wot used to localize the gene by analysis 
of homozygous deletions in both melanoma and non-melanoma tumor cell lines containing 
deletions. The mmttmim area of overlap of the deletions was flanked by IFNA-s and D9S171. 
25 YAC libraries were screened to identify genomic clones surrounding these markers. PI clones 

» 

were isolated as part of a chromosomal walk and were contiguous from IFNA-s to D9S171 except 
for two gaps. Specific sequence-tagged sites ("STS") were prepared to construct a more detailed 
molecular map. Using these markers and a deletion analysis, a region of deletion overlap centered 
around markers cS.l and cS3 f markers found on cosmid 5 (c5). The most frequently deleted 
30 marker was c5.3, which thus was very close to MTS. ^ .. 
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An analysis of cS for the presence of "CpG" islands showed that it contained at least one 
candidate gene for MTSL DNA sequences of EcoRI fragments of c5 were determined and 
compared against sequences from GenBank. Two distinct regions of c5 were identified that were 
similar to a region of a previously identified gene encoding human Cdk4 inhibitor, or pl6 (Serrano 
5 etal., 1993). These two candidate genes are called MTS1 and MTS2. Screening cDNA libraries 
of lymphocyte, fetal brain and normal breast with a probe from Exon 2 of MTS1 identified an 
additional candidate called MTSlElp. 

A detailed comparison of the genomic sequence from c5 with the pl6 mRNA seque n ce 
revealed that MTS 1 contained a stretch of 307 bp that was identical to a portion of die p 1 6 coding 

10 sequence. This stretch of nucleotides in MTS1 was flanked by recognizable splice junction 
sequences. Further characterization of MTS1 showed that ft included the entire coding sequence of 
pl6 plus two introns. Intron I was located 126 bp downstream from the translational start site; 
Intron 2 was located 11 bp upstream from the translational stop she. The two introhs divided the 
coding sequence of pl6 into three regions, a ? region of 126 bp (coding Exon IX a middle region 

15 of 307 bp (coding Exon 2), and a 3 f region of 11 bp (coding Exon 3). 

MTS2 contained a region of DNA sequences nearly identical to pl6 that extended from the ' 
S end of coding Exon 2 roughly 200 bp toward intron 2. However, the sequence similarity 
decreased to a point 51 bp upstream of Intron 2 in MTS1, where the two sequences diverged 
complet ;ty. This corresponds to the location of the final codon of MTS2. Comparison of sequences 

20 from MTS 1 and MTS2 showed that the sequence similarity between these two genes also extended 
nearly 50 nucleotides upstream from the 3 9 splice junction of intron 1. Thus, portions of noncoding 
DNA were more conserved than some areas of presumptive coding DNA. To exclude the 
possibility that the sequence divergence in coding DNA might be a cloning artifact, PCR primers 
were designed to amplify specifically across the sequence divergence point of MTS2. These 

25 primers amplified a fragment of the predicted size from cosmid PI and genomic DNA. Therefore, 
the divergent sequence located near the 3' end of Exon 2 in MTS2 is a bona fide genomic 
sequence. - 

MTSlEip contains an Exon 1, called Exon 1(3 or Eif3, which has a different sequence than 
found in Exon 1 of MTS1 and MTS2. MTS1E10 also contains Exon 2 (E2) and Exon 3 (E3) 

30 which are identical to Exons 2 and 3 of MTS1. Exon lp is located upstream of Exoir i-of MTS1 
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and does not contain any coding sequence. As a result MTSiEljJ encodes a plO which has a 
translation start she at the first ATG of Exon 2. 

MTS1 and MTS2 were tested for correspondence with the genetic susceptibility locus MTS 
by analyzing genomic DNA, using Exon 2, from individuals presumed to cany MLM predisposing 
5 alleles. DNA polymorphisms were identified in Exon 2 of MTS 1 in one of eight individuals* The 
mutation was a single nucleotide substitution, resulting in an amino acid change. This polymor- 
phism segregated with the MLM predisposing allele. 

The preponderance of lesions in MTS I (deletions and nucleotide substitutions) indicates that 
MTS1 or a closely linked locus contributes to the tumor phenotype. Cells that suffer these lesions 

10 enjoy a selective advantage over cells that do not The alternative explanation, that the lesions are 
random events having nothing to do with cell growth, is unlikely for several reasons. First, the 
high correlation between tumor phenotype and mutation at MTS1 implies a causal relation between 
MTS1 mutations and tumor formation. Second, MTS1 influences susceptibility to melanoma, and 
thus is implicated independently as a tumor suppressor gene. Third, the biochemical function of 

IS p!6 as a potent inhibitor of a Cdk neatly fits a model where MTS1 acts in vivo as a general 
inhibitor of the onset of DNA replication. 

According to the diagnostic and prognostic method of the present invention, alteration of the 
wild-type MTS locus is detected In addition, the method can be performed by detecting the wild- 
type MTS locus and confirming tL* lack of a predisposition or neoplasia. "Alteration of a wild- 

20 type gene" encompasses all forms of mutations including deletions, insertions and point mutations 
in the coding and noncodtng regions. Deletions may be of the entire gene or only a portion of the 
gene. Point mutations may result in stop codons, frameshift mutations or amino acid substitutions. 
Somatic mutations are those which occur only in certain tissues, e.g^ in the tumor tissue, and are 
not inherited in die germline. Gennline mutations can be found in any of a body's tissues and are 

25 inherited. If only a single allele is somatically mutated, an early neoplastic state is indicated. 
However, if both alleles are mutated, then a late neoplastic state is indicated. The finding of MTS 
mutations thus provides both diagnostic and prognostic information. An MTS allele which is not 
deleted (e.g., that found on the sister chromosome to a chromosome canying an MTS deletion) can 
be screened for other mutations, such as insertions, small deletions, and point mutations. It is 

30 believed that many mutations found in tumor tissues will be those leading to decreased expression 
of the MTS gene product However, mutations leading to non-functional gene products woukUlso 



WO 95/25813 



PCT/DS9S/03S37 



-14- 

lead to a cancerous state. Point mutational events may occur in regulatory regions, such as in the 
promoter of the gene, leading to loss or diminutton of expression of the mRNA Point mutations 
may also abolish proper RNA processing, leading to loss of expression of the MTS gene product, 
or a decrease in mRNA stability or translation efficiency. 

Useful diagnostic techniques include, but are not limited to fluorescent in situ hybridization 
(FISH), direct DNA sequencing, PFGE analysis, Southern blot analysis, single stranded 
conformation analysis (SSCA), RNase protection assay, allele-specific oligonucleotide (ASO), dot 
blot analysis and PCR-SSCP, as discussed in detail further below. 

Predisposition to cancers, such as melanoma and die other cancers identified herein, can be 
ascertained by testing any tissue of a human for mutations of the MTS gene. For example, a person 
who has inherited a genuine MTS mutation would be prone to develop carom . This can be 
determined by testing DNA from any tissue of the person's body. Most simply, blood can be 
drawn and DNA extracted from the cells of the blood. In addition, prenatal diagnosis' can be 
accomplished by testing fetal cells, placental cells or amniotic fluid for mutations of die MTS gene. 
Alteration of a wild-type MTS allele, whether, for example, by point mutation or by deletion, can 
be detected by any of the means discussed herein. 

In order to detect the alteration of the wild-type MTS gene in a tissue, it is helpful to isolate 
the tissue free from surrounding normal tissues. Means for enriching a tissue preparation for tumor 
cells are known in the ait. For example, the tissue may be Isolated from paraffin or cryostat 
sections. Cancer cells may also be separated from normal cells by flow cytometry. These 
techniques, as well as other techniques for separating tumor cells from normal cells, are well 
known in the art If the tumor tissue is highly contaminated with normal cells, detection of 
mutations is more difficult 

A rapid preliminary analysts to detect polymorphisms in DNA sequences can be performed 
by looking at a series of Southern blots of DNA cut with one or more restriction enzymes, 
preferably a large number of restriction enzymes. Each blot contains a series of normal individuals 
and a series of cancer cases, tumors, or both. Southern blots displaying hybridizing fragments 
(differing in length from control DNA when probed with sequences near or including the MTS 
locus) indicate a possible mutation. If restriction enzymes which produce very large restriction 
fragments are used, then pulsed field gel electrophoresis ("PFGE") is employed. . 
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Detection of point mutations may be accomplished by molecular cloning of the MTS 
allele(s) and sequencing that alkle(s) using techniques well known in the ait Alternatively, the 
gene sequences can be amplified; using known techniques, directly from a genomic DNA 
preparation from the tumor tissue. The DNA sequence of the amplified sequences can then be 
5 determined. 

There are six well known methods for a more complete, yet still indirect, test for confirming 
the presence of a susceptibility allele: 1) single stranded conformation analysis f SSCA") (Orita et 
aL, 1989); 2) denaturing gradient gel electrophoresis ("DOGE") (Wartell et aL, 1990; Sheffield et 
aL t 1989); 3) RNase protection assays (Finkelstein et aL, 1990; Kinszler et aL, 1991); 4) allele- 

10 specific oligonucleotides fASOs") (Conner et aL, 1983); 5) the use of proteins which recognize 
nucleotide mismatches, such as the £ coli mutS protein (Modrich, 1991); and, 6) allele-specific 
PCR (Rano & Kidd, 1989). For allele-spedfic PCR, primers are used which hybridize at their 3 9 
ends to a particular MTS mutation. If the particular MTS mutation is not present, an amplification 
product is not observed. Amplification Refractory Mutation System (ARMS) can also be used, as 

IS disclosed in European Patent Application Publication No. 0332435 and in Newton et aL, 1989. 
Insertions and deletions of genes can also be detected by cloning, sequencing and amplification. In 
addition, restriction fragment length polymorphism (RFLP) probes for the gene or surrounding 
marker genes can be used to score alteration of an allele or an insertion in a polymorphic fragment 
Such a method is particularly useful for screening relatives of an affected individual for the 

20 presence of the MTS mutation found in that individuaL Other techniques for detecting insertions 
and deletions as known in die art can be used. 

In the first three methods (Le^ SSCA, DOGE and RNase protection assay), a new 
electrophoretic band appears. SSCA detects a band which migrates differentially because the 
sequence change causes a difference in single-strand, intramolecular base pairing. RNase 

25 protection involves cleavage of the mutant polynucleotide into two or more smaller fragments. 
DOGE detects differences b migration rates of mutant se q uen c es compared to wild-type 
sequences, using a denaturing gradient gel. In an allele-sperific oligonucleotide assay, an 
oligonucleotide is designed which detects a specific sequence, and the assay is performed by 
detecting the presence or absence of a hybridization signal. In the mutS assay, the protein binds 

3 0 only to sequences that contain a nucleotide mismatch in a heteroduplex between mufanttnd wild- 
type sequences - — — — ^_ ■ - u • - 
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Misraatches, according to the present invention, are hybridized nucleic acid duplexes in 
which the two strands are not 100% complementary. Lack of total homology may be due to 
deletions, insertions, inversions or substitutions. Mismatch detection can be used to detect point 
mutations in the gene or its mRNA product While these techniques are less sensitive than 
5 sequencing, they are simpler to perform on a large number of tumor samples. An example of a 
mismatch cleavage technique is the RNase protection method. In the practice of the present 
invention, the method involves the use of a labeled riboprobe which is complementary to the 
human wild-type MTS gene coding sequence. Tie riboprobe and either mRNA or DNA isolated 
'from the tumor tissue axe annealed (hybridized) together and subsequently digested with the 

10 enzyme RNase A which is able to detect some mismntrhrs in a duplex RNA structure. If a 
mismatch is detected by RNase A, it cleaves at the site of the mismatch. Thus, when the annealed 
RNA preparation is separated on an electrophoretic gel matrix, if a mismatch has been detected and 
cleaved by RNase A, an RNA product will be seen which is smaller than the full length duplex 
RNA for the riboprobe and the mRNA or DNA. The riboprobe need not be the full length of die 

15 MTS mRNA or gene but can be a segment of either. If the riboprobe comprises only a segment of 
the MTS mRNA or gene, it will be desirable to use a number of these probes to screen the whole 
mRNA sequence for mismatches. 

In similar fashion, DNA probes can be used to detect mismatches, through enzymatic or 
chemical cleavage. See, eg., Cotton et al„ 1988; Shenk et al., 1975; Novack et aL, 1986. 

2 0 Alternatively, mismatches can be detected by shifts in the electrophoretic mobility of mismatched 

duplexes relative to matched duplexes. See, e.gn Cariello, 1988. With either riboprobes or DNA 
probes, the cellular mRNA or DNA which might contain a mutation can be amplified using PCR 
(see below) before hybridization. Changes in DNA of the MTS gene can also be detected, using 
Southern hybridization, especially if the changes are gross rearrangements, such as deletions and 
25 insertions. 

DNA sequences of the MTS gene which have been amplified by use of PCR may also be 
screened using allele-specific probes. These probes are nucleic acid oligomers, each of which 
contains a region of the MTS gene sequence harboring a known mutation. For example, one 
oligomer may be about.30 nucleotides in length, corresponding to a portion of the MTS gene 

3 0 sequence. By use of a battery of such allele-specific probes, PCR amplification products can be 

screened to identify the presence of a previously identified mutation in the- MT%^g$e* 
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Hybridization of allele-specific probes with amplified MTS sequences can be performed, for 
example, on a nylon filter. Hybridization to a particular probe under stringent hybridization 
conditions indicates the presence of the same mutation in the tumor tissue as in the allele-spccific 
probe. 

The most definitive test for mutations in a candidate locus is to directly compare genomic 
MTS sequences from cancer patients with those from a control population. Alternatively, one 
could sequence messenger RNA 'after amplification, e.g^ by PCR, thereby eliminating the 
necessity of determining the exon structure of the candidate gene. 

Mutations from cancer patients falling outside the coding region of MTS can be detected by 
examining the non-coding regions, such as introns and regulatory sequences near or within the 
MTS gene. An early indication that mutations in noncoding regions are important may come from 
Northern blot experiments that reveal messenger RNA molecules of abnormal size or abundance in 
cancer patients as c ompare d to control individuals. 

Alteration of MTS mRNA expression can be detected by any techniques known in the an 
These include Northern blot analysis, PCR amplification and RNase protection. Diminished 
mRNA expression indicates an alteration of the wild-type MTS gene. Alteration of wild-type MTS 
genes can also be detected by screening for alteration of wild-type MTS protein. For example, 
monoclonal antibodies immunoreactivc with MTS can be used to screen a tissue. Lack of cognate 
antigen would indicate an MTS mutation. Antibodies specific for products of mutant alleles could 
also be used to detect mutant MTS gene product Such immunological assays can be done in any 
convenient formats known in the art These include Western blots, immunohistochemical assays 
and ELISA assays. Any means for detecting an altered MTS protein can be used to detect 
alteration of wild-type MTS genes. Functional assays, such as protein binding deter m i n a tion s, can 
be used. For example, it is known that MTS protein binds to Cdks, especially Cdk4. Thus, an 
assay for the ability to bind to wild-type MTS protein or Cdk4 can be employed. In addition, 
assays can be used which detect MTS biochemical function, the inhibition of Cdks, such as Cdk4, 
and regulation of the cell cycle. Finding a mutant MTS gene product indicates alteration of a wild- 
type MTS gene. 

Mutant MTS genes or gene products can also be detected in other human body samples, such 
as serum, stool, urine and sputum. The same techniques discussed above for detection^ mutant 
MTS genes or- gene products in tissues can be applied to other body samples. Cancer ^gelli are 
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sloughed off from tumors and appear in such body samples. In addition, the MTS gene product 
itself may be secreted into the extracellular space and found in these body samples even in the 
absence of cancer cells. By screening such body samples, a simple early diagnosis can be achieved 
for many types of cancers. In addition, the progress of chemotherapy or radiotherapy can be 
monitored more easily by testing such body samples for mutant MTS genes or gene products. 

The methods of diagnosis of the present invention are applicable to any tumor in which MTS 
has a role in tumorigenesis. Deletions of chromosome arm 9p or somatic mutations within the 
MTS region have been observed in almost all tumors ~»ininH The diagnostic method of the 
present invention is useful for clinicians, so they can decide upon an appropriate course of 
treatment 

The primer pairs of die present invention are useful for determination of die nucleotide 
sequence of a particular MTS allele using the PCR. The pain of single-stranded DNA primers can 
be annealed to sequences within or surrounding the MTS gene on chromosome 9p in order to 
prime amplifying DNA synthesis of the MTS gene itself! A complete set of these primers allows 
synthesis of all of die nucleotides of the MTS gene coding irmrCT, Lc„ the exons. The set of 
primers preferably allows synthesis of both intron and exon sequences. Allele-specific primers can 
also be used. Such primers anneal only to particular MTS mutant alleles, and thus will only 
amplify a product in the presence of the mutant allele as a template. 

In order to facilitate subsequent cloning of amplified sequences, primers may have restriction 
enzyme site sequences appended to their S v ends. Thus, all nucleotides of the primers are derived 
from MTS sequences or sequences adjacent to MTS, except for the few nucleotides necessary to 
form a restriction enzyme site. Such enzymes and sites are well known in the art The primers 
themselves can be synthesized using techniques which are well known in die art Generally, the 
primers can be made using oligonucleotide synthesizing machines which are commercially 
available. Given the sequence pf the MTS open reading frames shown in SEQ ID NO:l, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO*, SEQ ID NO:13, SEQ ID NO:I5 and SEQ ID NO:36, design 
of particular primers is well within the skill of the art 

The nucleic acid probes provided by the present invention are useful for a number of 
purposes. They can be used in Southern hybridization to genomic DNA and in the RNase 
protection method for detecting point mutations already discussed above. The probes canJ>e used 



WO 95/25313 



PCT/US9S/03537 



-19- 

to detect PCR amplification products. They may also be used to detect mismatches with the MTS 
gene or mRNA using other techniques. 

Definitions 

S The present invention employs the following definitions: 

"Amplification of Polynucleotides" utilizes methods such as the polymerase chain reaction 
(PCR), ligation amplification (or ligase chain reaction, LCR) and amplification methods based on 
the use of Q-beta replicase. These methods are well known and widely practiced in the art See, 
e.g^ U.S. Patents 4,683,195 and 4,683,202 and Imris et al. f 1990 (for PCR); and Wu ct aL, 1989a 

10 (for LCR). Reagents and hardware for conducting PCR are commercially available. Primers useful 
to amplify sequences from the MTS region are preferably complementary to, and hybridize 
specifically to sequences in the MTS region or in regions that flank a target region therein. MTS 
sequences generated by amplification may be sequenced directly. Alternatively, but less desirably, 
the amplified sequence^) may be cloned prior to sequence analysis. A method for the direct 

1 5 cloning and sequence analysis of enigmatically amplified genomic segments has been described by 
Scha<1986. 

"Analyte polynucleotide" and "analyte strand" refer to a single- or double-stranded 
polynucleotide which is suspected of containing a target sequence, and which may be present in a 
variety of types of samples, including biological samples. ? 

2 0 "Antibodies." The present invention also provides polyclonal and/or monoclonal antibodies 

and fragments thereof and immunologic binding equivalents thereof, which are capable of 
specifically binding to the MTS polypeptides and fragments thereof or to polynucleotide sequences 
from the MTS region, particularly from the MTS locus or a portion thereof The term "antibody" 
is used both to refer to a homogeneous molecular entity, or a mixture such as a serum product 
25 made up of a plurality of different molecular entities. Polypeptides may be prepared synthetically 
in a peptide synthesizer and coupled to a carrier molecule (e.g^ keyhole limpet hemocyanin) and 
injected over several months into rabbits. Rabbit sera is tested for immunoreactivity to the MTS 
polypeptide or fragment Monoclonal antibodies may be made by injecting mice with the protein 
polypeptides, fusion proteins or fragments thereof. Monoclonal antibodies will be screened by 

3 0 ELIS A and tested for specific immunoreactivity with MTS polypeptide or fiagments-ihereof See, 

Harlow & Lancrl988. These antibodies wfll be useful in assays as well as pharmaceuticals,^—. 
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Once a sufficient quantity of desired polypeptide has been obtained, it may be used for 
various purposes. - A typical use is the production of antibodies specific for binding. These 
antibodies may be either polyclonal or monoclonal, and may be produced by in vitro or in vivo 
techniques well known in the art. 

For production of polyclonal antibodies, an appropriate target immune system, typically 
mouse or rabbit, is selected. Substantially purified antigen is presented to the immune system in a 
fashion determined by methods appropriate for the animal and by other parameters well known to 
immunologists. Typical sites for injection are in footpads, intramuscularly, intraperitoneally, or 
intradennaUy. Of course, other species may be substituted for mouse or rabbit Polyclonal 
antibodies are then purified using techniques known in the art, adjusted for the desired specificity. 

An immunological response is usually assayed with an immunoassay. Normally, such 
immunoassays involve some purification of a source of antigen, for example, that produced by the 
same cells and in the same fashion as the antigen. A variety of immunoassay methods are well 
known in the art. See, eg., Harlow & Lane, 1988, or Goding, 1986. 

Monoclonal antibodies with affinities of 10 4 M" 1 or preferably Iff* to 10"'° M" 1 or stronger 
will typically be made by standard procedures as described, e.g., in Harlow & Lane, 1988 or 
Goding, 1986. Briefly, appropriate animals win be selected and the desired immunization protocol 
followed. After the appropriate period of time, the spleens of such animals are excised and 
individual spleen cells fused, typically, to immortalized myeloma cells under appropriate selection 
conditions. Thereafter, the cells are donally separated and the superaatants of each clone tested for 
their production of an appropriate antibody specific for the desired region of the antigen. 

Other suitable techniques involve in vitro exposure of lymphocytes to the antigenic 
polypeptides, or alternatively, to selection of libraries of antibodies in phage or similar vectors. See 
Huse et al„ 1989. The polypeptides and antibodies of the present invention may be used with or 
without modification. Frequently, polypeptides and antibodies will be labeled by joining, either 
covalently or non-covalerrtly, a substance which provides for a detectable signal A wide variety of 
labels and conjugation techniques are known and are reported extensively in both the scientific and 
patent literature. Suitable labels include radionuclides, enzymes, substrates, cofactors, inhibitors, 
fluorescent agents, chenuluminescent agents, magnetic particles and the like. Patents teaching the 
use of such labels include U.S. Patents 3,817,837; 3.850,752; 3,939,350; 3,996^45^4^77,437; 
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4,275,149; and 4,366,241. Also, recombinant immunoglobulins may be produced (see U.S. Patent 
4,816,567). 

"Binding partner" refers to a molecule capable of binding a ligand molecule with high 
specificity, as for example, an antigen and an antigen-specific antibody or an enzyme and its 
inhibitor In general the specific binding partners must bind with sufficient affinity to immobilize 
the analyte copy/ complementary strand duplex (in the case of polynucleotide hybridization) under 
the isolation conditions. Specific binding partners are known in the art and include, for example, 
biotin and avidin or streptavidin, IgG and protein A, the numerous, known receptor-ligand couples, 
and complementary polynucleotide strands. In the case of complementary polynucleotide binding 
partners, the partners are normally at least about 15 bases mlengu\ and may be at least 40 bases in 
length. The polynucleotides may be composed ofDNA, RNA, or synthetic nucleotide analogs. 

A "biological sample" refers to a sample of tissue or fluid suspected of containing an 
analyte polynucleotide or polypeptide from an individual including, but not limited to, eg., plasma, 
serum, spinal fluid, lymph fluid, the external sections of the akin, respiratory, intestinal, and genito- 
urinary tracts, tears, saliva, blood cells, tumors, organs, tissue and samples of in vitro cell culture 

COQStltUCfitS* 

As used herein, the terms "diagnosing" or "prognosing," as used in the context of neoplasia, 
are used to indicate 1) the classification of lesions as neoplasia, 2) the detennination of the severity 
of the neoplasia, or 3) the monitoring of the disease progression, prior to, during and after 
txcstmciit* 

"Encode". A polynucleotide is said to "encode" a polypeptide if, in its native state or when 
manipulated by methods well known to those skilled in the art, it can be transcribed and/or 
translated to produce the mRNA for and/or the polypeptide or a fragment thereof The anti-sense 
strand is the complement of such a nucleic acid, and the encoding sequence can be deduced 
therefrom. 

"Isolated" or "substantially pure". An "isolated" or "substantially pure" nucleic acid (e.g, 
an RNA, DNA or a mixed polymer) is one which is substantially separated from other cellular 
components which naturally accompany a native human sequence or protein, e.g., ribosomes, 
polymerases, many other human genome sequences and proteins. The term embraces a nucleic 
acid sequence or protein which has been removed from its naturally occurring environment, and 
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includes recombinant or cloned DNA isolates and chemically synthesized analogs or analogs 
biologically synthesized by heterologous systems. 

"MTS Allele" refers to normal alleles of the MTS locus as well as alleles carrying variations 
thai predispose individuals to develop cancer of many sites including, for example, melanoma, 
5 ocular melanoma, leukemia, astrocytoma, glioblastoma, lymphoma, glioma, Hodgkin's lymphoma, 
multiple myeloma, sarcoma, myosarcoma, cholangiocarcinoma, squamous cell carcinoma, CLL, 
and cancers of the pancreas, breast,' brain, prostate, bladder, thyroid, ovary, uterus, testis, kidney, 
stomach, colon and rectum. Such predisposing alleles are also called "MTS susceptibility 
alleles". 

10 "MTS Locus," "MTS gene," "MTS Nucleic Adda" or "MTS Polynucleotide" refer to 

polynucleotides, all of which art in the MTS region, that are likely to be expressed in normal 
tissue, certain alleles of which predispose an individual to develop melanoma and other cancers, 
such as ocular melanoma, leukemia, astrocytoma, glioblastoma, lymphoma, glioma, Hodgkin's 
lymphoma, multiple myeloma, sarcoma, myosarcoma, cholangiocarcinoma, squamous cell 

15 carcinoma, CLL, and cancers of die pancreas, breast, brain, prostate, bladder, thyroid, ovary, 
uterus, testis, kidney, stomach, colon and rectum. The MTS locus is used interchangeably herein 
with the prior art designation MLM locus, and the use of "MTS" is intended to include "MLM" as 
used with reference to locus, gene, region, and the like* Mutations at the MTS locus may be 
involve 1 in the initiation and/or progression of other types of tumors. The locus is indicated in part 

20 by mutations that predispose individuals to develop cancer. These mutations fell within the MTS 
region described infra. The MTS locus is intended to include coding sequences, intervening 
sequences and regulatory elements controlling transcription and/or translation. The MTS locus is 

intended to include all allelic variations of the DMA <*qnenCft, 

These terms, when applied to a nucleic acid, refer to a nucleic acid which encodes a MTS 
25 polypeptide (including pi 6), fragment, homolog or variant, including, eg, protein fusions or 
deletions. The nucleic adds of the present invention will possess a sequence which is either 
derived from, or substantiaJly_similar_to a natural MTS-encoding gene or one having substantial 
homology with a natural MTS-encoding gene or a portion thereol The coding sequence for an 
MTS polypeptide (MTS1) is shown in SEQ ID NO:l, and the amino acid sequence of an MTS 
30 polypeptide (MTS1) is shown in SEQ CD N02. The coding sequence for a second MTS 
polypeptide (MTS1EIP) is shown in SEQ ID NO:13, and the corresponding amino add se^ugge 
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is shown in SEQ ID NO: 14. The coding sequence for a third MTS polypeptide (MTS2) is shown 
in SEQ ID NO:15,-and the corresponding amino acid sequence is shown in SEQ ID NO:16. The 
term P16 is used interchangeably with MTS1 and MTS1E10 and is used to mean both MTS1 
which encodes a p!6 and MTSlElp which encodes a plO. MTS1 and MTSlElp are two forms of 
one gene, the tow forms being dependent upon which of two promoters is used for transcription. 
MTS2 is a separate portion of the MTS region and it encodes a pl5. 

The polynucleotide compositions of this invention include RNA, cDNA, genomic DNA, 
synthetic forms, and mixed polymers, both sense and antisense strands, and may be chemically or 
biochemica l ly modified or may contain non-natural or derivatized nucleotide bases, as will be 
readily appreciated by those skilled in the art Such modifications inrhHf, for example, labels, 
methylation, substitution of one or more of the naturally occurring nucleotides with an a nalo g, 
internudeotide modifications such as uncharged linkages (eg*, methyl phosphorates, 
phosphotriestera, phosphoamidatea, carbamates, etc), charged linkages (e.&, phosphorothioates, 
phosphorodhhioates, etc), pendent moieties (e.&, polypeptides), intercalators (e.g., acridine, 
psoralen, etc.), chelators, alkylates, and modified linkages (eg., alpha anomeric nucleic acids, 
etc.). Also included are synthetic molecules that mimic polynucleotides in their ability to bind to a 
designated sequence via hydrogen bonding and other chemical interactions. Such molecules are 
known in the art and include, for example, those in which peptide linkages substitute for phosphate 
linkages in the backbone of the molecule. 

The present invention provides recombinant nucleic acids comprising all or part of the MTS 
region. Hie recombinant construct may be capable of replicating autonomously in a host cell 
Alternatively, the recombinant construct may become integrated into the chromosomal DNA of the 
host celL Such a recombinant polynucleotide comprises a polynucleotide of genomic, cDNA, 
semi-synthetic, or synthetic origin which, by virtue of its origin or manipulation, 1) is not associa- 
ted with all or a portion of a polynucleotide with which it is associated in nature; 2) is linked to a 
polynucleotide other than that to which it is linked in nature; or 3) does not occur in nature. 

Therefore, recombinant- nucleic acids comprising sequences otherwise not naturally 
occurring are provided by this invention. Although the wild-type sequence may be employed, it 
will often be altered, e.g., by deletion, substitution or insertion. 

cDNA or genomic libraries of various types may be s cree n ed as natural sources of the 
nucleic acids of -the present invention, or such nucleic acids may be provided -by amplific^oa^f 
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sequcnces resident in genomic DNA or other natural sources, e.g., by PCR. The choice of cDNA 
libraries normally corresponds to a tissue source which is abundant in mRNA for the desired 
proteins. Phage libraries are normally preferred, but other types of libraries may be used. Clones of 
a library are spread onto plates, transferred to a substrate for screening, denatured and probed for 
5 the presence of desired sequences. 

The DNA sequences used in this invention will usually comprise at least about five codons 
(IS nucleotides), more usually at least about 7-15 codons, and most preferably, at least about 35 
codons. One or more introns may also be present. This number of nucleotides is usually about the 
minimal length required for a successful probe that would hybridize specifically with a MTS- 

10 encoding sequence. 

Techniques for nucleic acid manipulation are described generally, for example, in Sambrook 
et aL, 1989 or Ausubel et al., 1992. Reagents useful in applying such techniques, such as restriction 
enzymes and the like, are widely known in the art and commercially available from such vendors 
as New England BioLabs, Boefaringer Mannheim, Amersham, Promega Biotec, U. S. 

15 Biochemicals, New England Nuclear, and a number of other sources. The recombinant nucleic 
acid sequences used to produce fusion proteins of die present invention may be derived from 
natural or synthetic sequences. Many natural gene sequences are obtainable from various cDNA or 
from genomic libraries using appropriate probes. See, GenBank, National Institutes of Health. 
"MTS Region" refers to a portion of human chnnrosome 9 found in the PI clones PI- 1062 

20 and PI -1 063. These PI clones, in £ coll NS3529, were deposited with the American Type Culture 
Collection, Rockville, Maryland USA on 16 March 1994 and assigned ATCC Nos. 69589 and 
69590, respectively. This region contains the MTS locus, including the MTS1, MTS2 and 
MTSlElp genes. 

As used herein, the terms "MTS locus," "MTS allele" and "MTS region" all refer to the 
25 double-stranded DNA comprising the locus, allele, or region, as well as either of the single- 
stranded DNAs comprising the locus, allele or region. 

As used herein, a "portion" of the MTS locus or region or allele is defined as having a 
minimal size of at least about eight nucleotides, or preferably about 15 nucleotides, or more 
preferably at least about 25 nucleotides, and may have a minimal size of at least about 40 
30 nucleotides. - 
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"MTS protein" or "MTS polypeptide 91 refer to a protein or polypeptide encoded by the 
MTS locus (including MTS1 polypeptide, MTS2 polypeptide and MTS1E10 polypeptide), variants 
or fragments thereof Hie torn "polypeptide* refers to a polymer of amino acids and its equivalent 
and does not refer to a specific length of the product; thus, peptides, oligopeptides and proteins art 
5 included within the definition of a polypeptide. This term also does not refer to, or exclude 
modifications of the polypeptide, for example, glycosylations, acetylations, phosphorylations, and 
the like. Included within the definition are, for example, polypeptides containing one or more 
analogs of an amino acid (including, for example, unnatural amino acids, etc), polypeptides with 
'substituted linkages as well as other modifications known in the art, both naturally and non- 
10 naturally occurring. Ordinarily, such polypeptides will be at least about 50% homologous to the 
native MTS sequence, preferably in excess of about 90% and more preferably at least about 95Y% 
homologous. Also included ait proteins encoded by DNA which hybridize under high or low 
stringency conditions, to MTS-encoding nucleic acids and closely related polypeptides or proteins 
retrieved by antisen to the MTS proteins). 
15 The length of polypeptide sequences compared for homology will generally be at least about 

1 6 amino acids, usually at least about 20 residues, more usually at least about 24 residues, typically 
at least about 28 residues, and preferably more than about 35 residues. 

"Optrably linked 19 refers to a juxtaposition wherein the components so described are in a 
relationship permitting them to function in their intended manner. For instance, promoter is 
20 operably linked to a coding sequence if the promoter affects its transcription or e xpres sion. 

"Probes' 9 . Polynucleotide polymorphisms associated with MTS alleles which predispose to 
certain cancers or are associated with most cancers are d et ect e d by hybridization with a 
polynucleotide probe which forms a stable hybrid with that of the target sequence, under stringent 
to moderately stringent hybridization and wash conditions. If it is expected that the probes will be 
25 perfectly complementary to die target sequence, stringent conditions will be used. Hybridization 
stringency may be lessened if some mismatching is expected, for example, if variants are expected 
with the result that the probe will not be completely complementary. Conditions are chosen which 
rule out nonspecific/adventitious bindings, that is, which minimize noise. Since such indications 
identify neutral DNA polymorphisms as well as mutations, these indications need further analysis 
30 to demonstrate detection of a MTS susceptibility allele. ^ M . . . 
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Probes for MTS alleles may be derived from the sequences of the MTS region or its cDNAs. 
The probes may be of any suitable length, which span all or a portion of the MTS region, and 
which allow specific hybridization to the MTS region. If the target sequence contains a sequence 
identical to that of the probe, the probes may be short, e.g., in the range of about 8-30 base pairs, 
5 since the hybrid will be relatively stable under even stringent conditions. If some degree of 
mismatch is expected with the probe, Lc, if it is suspected that the probe will hybridize to a variant 
region, a longer probe may be employed which hybridizes to the target sequence with the requisite 
specificity. 

The probes will include an isolated polynucleotide attached to a label or reporter molecule 

10 and may be used to isolate other polynucleotide sequences, having sequence similarity by standard 
methods. For techniques for preparing and labeling probes see, e.g., Sambrook et ah, 1989 or 
Ausubel et aL, 1991 Other similar polynucleotides may be selected by using homologous 
polynucleotides. Alternatively, polynucleotides encoding these or similar polypeptides may be 
synthesized or selected by use of the redundancy in the genetic code. Various codon substitutions 

15 may be introduced, e.g^ by silent changes (thereby producing various restriction sites) or to 
optimize expression for a particular system. Mutations may be introduced to modify the properties 
of the polypeptide, perhaps to change ligand-binding affinities, interchain affinities, or the 
polypeptide degradation or turnover rate. 

Probes comprising synthetic oligonucleotides or other polynucleotides of the present 

20 invention may be derived from naturally occurring or recombinant single- or double-stranded 
polynucleotides, or be chemically synthesized. Probes may also be labeled by nick translation, 
Klenow fill-in reaction, or other methods known in the art 

Portions of the polynucleotide sequence having at least about eight nucleotides, usually at 
least about IS nucleotides, and fewer than about 6 Kb, usually fewer than about 1.0 Kb, from a 

25 polynucleotide sequence encoding MTS are preferred as probes. The probes may also be used to 
determine whether mRNA encoding MTS is present in a cell or tissue. 

"Protein modifications or fragments 91 are provided by the present invention for MTS poly- 
peptides or fragments thereof which are substantially homologous to primary structural sequence 
but which include, eg,, in vivo or in vitro chemical and biochemical modifications or which 

30 incorporate unusual amino acids. Such modifications inchide^ for example,, acetylation, 
carboxylation^ phosphorylation, glycosylation, ubiquitination, labeling, e.g., #ith radion uclide s. 
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and various enzymatic modifications, as will be readily appreciated by those well skilled in the ait 
A variety of methods for labeling polypeptides and of substituents or labels useful for such 
purposes axe well known in the at, and include radioactive isotopes such as S *P, ligands, which 
bind to labeled antiligands (e.g., antibodies), fluorophores, chemiluminescent agents, enzymes, and 
antiligands which can serve as specific binding pah members for a labeled iigand. The choice of 
label depends on the sensitivity required, ease of conjugation with the prima, stability 
requirements, and available instrumentation. Methods of labeling polypeptides are well known in 
the art See, e,g.,SambrooketaL, 1989 or AusubeletaL, 1991 

Besides substantially foil-length polypeptides, the present invention provides for biologically 
active fragments of the polypeptides. Significant biological activities include ligand-binding, 
' immunological activity and other biological activities characteristic of MTS polypeptides. 
Immunological activities include both immunogenic function in a target immune system, as well as 
sharing of i mmu nological epitopes for binding, serving as either a competitor or substitute antigen 
for an epitope of the NfTS protein. As used herein, "epitope" refers to an antigenic determinant of a 
polypeptide* An epitope could comprise three amino acids in a spatial conformation which is 
unique to the epitope. Generally, an epitope consists of at least five such amino acids, and more 
usually consists of at least 8*10 such amino adds. Methods of determining the spatial 
conformation of such amino acids are known in the art 

F. r immunological purposes, tandem-repeat polypeptide segments may be used as 
immunogens, thereby producing highly antigenic proteins. Alternatively, such polypeptides will 
serve as highly efficient competitors for specific binding. Production of antibodies specific for 
MTS polypeptides or fragments thereof is described below. 

The present invention also provides for fusion polypeptides, comprising MTS polypeptides 
and fragments. Homologous polypeptides may be foaons between two or more MTS polypeptide 
sequences or between the sequences of MTS and a related protein. Likewise, heterologous fusions 
may be constructed which would exhibit a combination of properties or activities of the derivative 
proteins. For example, ligand-binding or other domains may be "swapped" between different new 
fusion polypeptides or fragments. Such homologous or heterologous fusion polypeptides may 
display, for example, altered strength or specificity of binding. Fusion partners include 
immunoglobulins, bacterial p-galactosidase, txpE, protein A, (^lactamase, alpha amylase; alcohol 
dehydrogenase" and yeast alpha mating factor. See, c.g., Godowski et aL, 1988. — — . 
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Fusion proteins wUl typically be made by either recombinant nucleic acid methods, as 
described below, or may be chemically synthesized. Techniques for the synthesis of polypeptides 
are described, for example, in Merrifield, 1963. 

"Protein purification" refers to various methods for the isolation of the MTS polypeptides 
from other biological material, such as from cells transformed with recombinant nucleic adds 
encoding MTS, and are well known in the art For example, such polypeptides may be purified by 
immunoaffinity chromatography employing, e.g, the antibodies provided by the present invention. 
Various methods of protein purification are well known in the art, and include those described in 

Deutscher, 1990 and Scopes, 198Z 

The terms "isolated", "substantially pure", and "substantially homogeneous" are used 

interchangeably to describe a protein or polypeptide which has been separated from components 
which accompany it in its natural state. . A monomeric protein is substantially puns when at least 
about 60 to 75% of a sample exhibits a single polypeptide sequence. A substantially pure protein 
will typically comprise about 60 to 90% WAV of a protein sample, more usually about 95%, and 
preferably wffl be over about 99% pure. Protein purity or homogeneity may be indicated by a 
number of means well known in the art, such as polyaoylamide gel electrophoresis or a protein 
sample, followed by visualizing a single polypeptide band upon staining the gel For certain 
purposes, higher resolution may be provided by using HPLC or other means well known in the art 

for purification utilized. 

A MTS protein is substantially free of naturally associated components when it is separated 
from the native contaminants which accompany It in hs natural state. Thus, a polypeptide which is 
chemically synthesized or synthesized in a cellular system different from the cell from which it 
naturally originates will be substantially free from its naturally associated components. A protein 
may also be rendered substantially free of naturally associated components by isolation, using 
protein purification techniques well known in the art 

A polypeptide produced as an expression product of an isolated and manipulated genetic 
sequence is an "isolated polypeptide^as used henrim even if expressed m a to cell type- 

Synthetically made forms or molecules expressed by heterologous cells are inherently isolated 
. molecules. 

"Recombinant nucleic add" is a nucleic acid which is not naturally o<xurring r or which is 
made by the-artifidal combination of two otherwise separated segmenbk of sequenc£_Tbis 
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artificial combination is often accomplished by either chemical'synthesis means, or by the artificial 
manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques. Such 
is usually done to replace a codon with a redundant codon encoding the same or a conservative 
flminn acid, while typically introducing or removing a sequence recognition she. Alternatively, it 
is perforated to join together nucleic add segments of desired functions to generate a desired 
combination of functions. 

"Regulatory sequences" refers to those sequences normally within 10 Kb of the coding 
region of a locus which affect the expression of the gene (including transcription of the gene, and 
translation, splicing, stability or the like of the messenger RNA). 

"Substantial homology or similarity". A nucleic add or fragment thereof is "substantially 
homologous" ("or substantially similar") to another if, when optimally aligned (with appropriate 
nucleotide insertions or deletions) with the other nucleic acid (or its complementary strand), there 
is nucleotide sequence identity in at least about 60% of the nucleotide bases, usually at least about 
70%, more usually at least about 80%, preferably at least about 90%, and more preferably at least 
about 95-98% of die nucleotide bases. 

Alternatively, substantial homology or (similarity) exists when a nucleic acid or fragment 
thereof will hybridize to another nucleic acid (or a complementary strand thereof) under selective 
hybri dizati on conditions, to a strand, or to its complement Selectivity of hybridization exists when 
hybridization which is substantially more selective than tot J lack of specificity occurs. Typically, 
selective hybridization will occur when there is at least about 55% homology over a stretch of at 
least about 14 nucleotides, preferably at least about 65%, more preferably at least about 75%, and 
most preferably at least about 90% See, Kanebisa, 1984. The length of homology comparison, as 
described, may be over longer stretches, and in certain embodiments will often be over a stretch of 
at least about nine nucleotides, usually at least about 20 nucleotides, more usually at least about 24 
nucleotides, typically at least about 28 nucleotides, more typically at least about 32 nucleotides, 
and preferably at least about 36 or more nucleotides. 

Nucleic add hybridization will be affected by such conditions as salt concentration, 
temperature, or organic solvents, in addition to the base composition, length of the complementary 
strands, and the number of nucleotide base mismatches between the hybridizing nucleic adds, as 
will be readily appreciated by those skilled in the art Stringent temperature conditions will 
generally include temperatures in excess of 30°C, typically in excess of 37°C anipTefer^yJn 
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excess of 45°C. Stringent salt conditions will ordinarily be less than 1000 mM, typically less than 
500 mM, and preferably less than 200 mM. However, the combination of parameters is much more 
important than the measure of any single parameter. See, e.g., Wetmur & Davidson, 1968. 

Probe sequences may also hybridize specifically to duplex DNA under certain conditions to 
form triplex or other higher order DNA complexes. The preparation of such probes and suitable 
hybridization conditions are well known in the ait 

The terms "substantial homology" or "substantial identity", when referring to 
polypeptides, indicate that the polypeptide or protein in question exhibits at least about 30% 
' identity with an entire naturally-occurring protein or a portion thereof usually at least about 7096 
identity, and preferably at least about 95% identity. 

"Substantially similar function" refers to the function of a modified nucleic acid or a 
modified protein, with reference to the wild-type MTS nucleic add or wild-type MTS polypeptide. 
The modified polypeptide will be substantially homologous to the wild-type MTS polypeptide and 
will have substantially the same function, Le^.the inhibition of Cdks, especially Cdk4. The 
modified polypeptide may have an altered amino acid sequence and/or may contain modified 
amino acids. In addition to the function of inhibiting Cdks, the modified polypeptide may have 
other useful properties, such as a longer half-life. Hie Cdk-inhibitory activity of the modified 
polypeptide may be substantially the same as the activity of the wild-type MTS polypeptide. 
Alternatively, the Cdk-inhibitoiy activity of the modified polypeptide may be hif 'wr than the 
activity of the wild-type MTS polypeptide. The modified polypeptide is synthesized using 
conventional techniques, or is encoded by a modified nucleic add and produced using 
conventional techni ques. The modified nucleic acid is prepared by conventional techniques. A 
nucleic acid with a function substantially similar to the wild-type MTS gene function produces the 
modified protein described above. 

Homology, for polypeptides, is typically measured using sequence analysis software. See, 
e.g., the Sequence Analysis Software Package of the Genetics Computer Group, University of 
Wisconsin Biotechnology Center, 910 University Avenue, Madison, Wisconsin S370S. Protein 
analysis software matches similar sequences using measure of homology assigned to various 
substitutions, deletions, substitutions, and other modifications. Conservative substitutions typically 
include substitutions within the following groups: glycine, alanine; valine, isoleujCjfcfc* leucine; 
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asparric acid, glutamic acid; asparagine, glutaraine; serine, threonine; lysine, arginine; and 
phenylalanine, tyrosine. 

A polypeptide "fragment," "portion" or "segment" is a stretch of amino acid residues of at 
least about five to seven contiguous amino adds, often at least about seven to nine contiguous 
amino acids, typically at least about nine to 13 contiguous amino acids and, most preferably, at 
least about 20 to 30 or more contiguous amino adds. 

The polypeptides of the present invention, if soluble, may be coupled to a solid-phase 
support, nitrocellulose, nylon, column packing materials (eg,, Sephaxose beads), magnetic 
beads, glass wool, plastic, metal, polymer gels, cells, or other substrates. Such supports may take 
the form, for example, of beads, wells, dipsticks, or membranes. 

"Target region 19 refers to a region of the nucleic add which is amplified and/or detected 
The term "target sequence" refers to a sequence with which a probe or primer will form a stable 
hybrid under desired conditions. 

The practice of the presen t invention employs, unless otherwise indicated, conventional 
techniques of chemistry, molecular biology, microbiology, recombinant DNA, genetics, and 
immunology. See, e.g., Maniatis et aL, 1982; Sambrook et aL, 1989; Ausubel et aL, 1992; Glover, 
1985; Anand, 1992; Guthrie & Fink, 1991. A general discussion of techniques and materials for 
human gene mapping, including mapping of human chromosome 9p, is provided, e.g., in White & 
Laloud, 1988. 

Preparation of recombinant or chemically svnth«imd 
nucleic acid* vector* tran«fnmMtmfi hncr 

Large amounts of the polynucleotides of the present invention may be produced by 
replication in a suitable host celL Natural or synthetic polynucleotide fragments coding for a 
desired fragment will be incorporated into recombinant polynucleotide constructs, usually DNA 
constructs, capable of introduction into and replication in a prokaryotic or e ukar yotic cell. Usually 
the polynucleotide constructs win be suitable for replication in a unicellular host, such as yeast or 
bacteria, but may also be intended for introduction to (with and without integration within the 
genome) cultured mammalian or plant or other eukaryotic cell lines. The purification of nucleic 
acids produced by the methods of the present invention are described, e.g., in Sambrook et aL, 
1989 or Ausubel etaL, 1992. 



WO 95/25813 



PCTOJS95/03537 



-32- 

The polynucleotides of the present invention may also be produced by chemical synthesis, 
e.g., by the phospboramidite method described by Beaucage & Carrutbers, 1981 or the triester 
method according to Matteucci et aL, 1981, and may be performed on commercial, automated 
oligonucleotide synthesizers. A double-stranded fragment may be obtained from the single- 
stranded product of chemical synthesis either by synthesizing the complementary strand and 
annealing the strands together under appropriate conditions or by adding the complementary strand 
using DNA polymerase with an appropriate primer sequence. 

Polynucleotide constructs prepared for introduction into a prokaryotic or eukaryotic host 
may comprise a replication system recognized by the host, including the intended polynucleotide 
fragment encoding the desired polypeptide, and will preferably also include transcription and 
t ranslati onal initiation regulatory sequence* operably linked to the polypeptide encoding segment 
Expression vectors may include, for example, an origin of replication or autonomously replicating 
sequence (ARS) and expression control sequences, a promoter, an enhancer and necessary 
processing information sites, such as ribosome-binding sites, RNA splice sites, polyadenylation 
sites, transcriptional terminator sequences, and mRNA stabilizing sequences. Secretion signals 
may also be included where appropriate, whether from a native MTS protein or from other 
receptors or from secreted polypeptides of the same or related species, which allow the protein to 
cross and/or lodge in cell membranes, and thus attain its functional topology, or be secreted from 
the cc ,1. Such vectors may be prepared by means of standard recombinant techniques well known 
in the art and discussed, for example, in Sambrook et aL, 1 989 or Ausubel et aL 1 992. 

The selection of an appropriate promoter and other necessary vector sequences will be 
selected so as to be functional in the host, and may include^ when appropriate, those naturally 
associated with MTS genes. Examples of workable combinations of cell lines and expression 
vectors are described in Sambrook et aL, 1 989 or Ausubel et aL, 1 992; see also, e.g„ Metzgcr et aL, 
1988. Many useful vectors are known in the art and may be obtained from such vendors as 
Stratagene, New England Biolabs, Promega Biotech, and others. Promoters such as the trp, lac and 
phage promoters, tRNA promoters and glycolytic enzyme promoters may be used in prokaryotic 
hosts. Useful yeast promoters include promoter regions for metallothionein, 3-phosphoglycerate 
kinase or other glycolytic enzymes such as enolase or glyceraldehyde-3-phosphale dehydrogenase, 
enzymes responsible for maltose and galactose utilization, and others. Vectors and -promoters 
suitable for "me in yeast expression are further described in Hitzeman et aL,-EP 23^675 A. 
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Appropriate non-native mammalian promoters might include the early and late promoters from 
SV40 (Tiers et aL, 1978) or promoters derived from murine molony leukemia virus, mouse tumor 
vims, avian sarcoma viruses, adenovirus C, bovine papilloma virus or polyoma. In addition, the 
construct may be joined to an amplifiable gene (e.g. DHFR) so that multiple copies of the gene 
may be made. For appropriate enhancer and other expression control sequences, see also 
Enhancers and Eukaryotic Gene Expression, Cold Spring Harbor Press, Cold Spring Harbor, New 
York (1983). 

While such expression vectors may replicate autonomously, they may also replicate by being 
inserted into the genome of the host by methods weU known in the art 

Expression and cloning vectors will likely contain a selectable marker, a gene encoding a 
protem necessary for sumval OTgro^ The presence of 

mis gene ensures growth of only those host cells which express the inserts. Typical selection genes 
encode proteins that a) confer resistance to antibiotics or other toxic substances, e.g. ampicillin, 
neomycin, methotrexate, etC4 b) complement, auxotrophic deficiencies, or c) supply critical 
nutrients not available from complex media, e.g. the gene encoding D-alamne racemase for 
Bacilli. The choice of the proper selectable marker will depend on the host cell, and appropriate 
markers for different hosts are well known in the art 

The vectors containing the nucleic adds of interest can be transcribed to vitro, and the 
resulting RNA introduced into he host cell by well-known methods, eg., by injection (see, T. 
Kubo et aL, 1988), or u« vectors can be mtrodu^ 

in the art, which vary depending on the type of cellular host, including electroporation; transfection 
employing calcium chloride, rubidium chloride, calcium phosphate, DEAE-dextran, or other 
substances; microprojectile bombardment; lipofection; infection (where the vector is an infectious 
agent, such as a retroviral genome); and other methods. See generally, Sambrook et aL, 1989 and 
Ausubel et aL, 1992. The introduction of the polynucleotides into the host cell by any method 
known in the art, including, Inter alia, those described above, wffl be referred to herein as 
"transformatioa" The cells into which have been introduced nucleic acids described above are 
meant to also include the progeny of such cells. 

Large quantities of the nucleic adds and polypeptides of the present invention may be 
prepared by expressing the MTS nucleic adds or portions thereof in vectors or other-expression 
vehicles in compatible prokaryotic or eukaryotic host cells. The most commonly used prokfgotic 
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hosts are strains of Escherichia coll, although other prokaryotes, such as Bacillus subtilis or 

Pseudomonas may also be used. 

Mammalian or other eukaryotic host cells, such as those of yeast, filamentous fungi, plant, 
insect, or amphibian or avian species, may also be useful for production of the proteins of the 
present invention. Propagation of mammalian cells in culture is per se well known. See, Jakoby 
and Pastan (eds.), 1979. Examples of commonly used mammalian host cell lines are VERO and 
HeLa cells, Chinese hamster ovary (CHO) cells, and WI38, BHK, and COS cell lines, although it 
will be appreciated by the skilled practitioner that other cell lines may be appropriate, e.g., to 
provide higher expression, desirable glycosylation patterns, or other features. 

Clones are selected by using markers depending on the mode of the vector construction. The 
marker may be on the same or a different DNA molecule, preferably the same DNA molecule In 
prokaiyotic hosts, the transformant may be selected, e.g, by resistance to arnpicillin, tetracycline or 
other antibiotics. Production of a particular product based on temperature sensitivity may also 

serve as an appropriate marker. 

Prokaryotic or eukaryotic cells transformed with the polynucleotides of the present invention 
will be useful not only for the production of the nucleic acids and polypeptides of the present 
invention, but also, for example, in studying the characteristics ofMTS polypeptides. 

Antisense polynucleotide sequences are useful in preventing or diminishing the expression of 
the MTS locus, as will be appreciated by those skilled *n the art. For example, polynucleotide 
vectors containing all or a portion of the MTS locus or other sequences from the MTS region 
(particularly those flanking the MTS locus) may be placed under die control of a promoter in an 
antisense orientation and introduced into a celL Expression of such an antisense construct within a 
cell will interfere with MTS transcription and/or translation and/or replication. 

Cycline and Cdks are ubiquitous cell-cycle control elements in eukaryotes. Such proteins 
were initially discovered in yeast, and have been found in marine invertebrates, amphibians and 
mammals, including mouse, rabbit and humans. Homologous cell-cycle control genes are 
identified in other species by using probes and/or primers based on the gene sequence in one 
species. Thus, probes and primers based on the MTS gene sequences disclosed herein are used to 
identify homologous MTS gene sequences and proteins in other species. These MTS gene 
sequences and proteins are used in the diagnostic/prognostic, therapeutic and drug- screening 
methods described herein for the species from which they have been isolated.- — ^ 
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MflfafldS QfUae; Miclric Acid Diagnosis and Diagnostic Kits 

In order to detect the presence of a MTS allele predisposing an individual to cancer, a 
biological sample such as blood is prepared and analyzed for the presence or absence of 
5 susceptibility alleles of MTS. In order to detect the presence of neoplasia, the progression toward 
malignancy of a precursor lesion, or as a prognostic indicator, a biological sample of the lesion is 
prepared and analyzed for the presence or absence of neoplastic alleles of MTS. Results of these 
tests and interpretive information are returned to the health care provider for communication to the 
tested individual. Such diagnoses may be performed by diagnostic laboratories, or, alternatively, 
10 diagnostic kits are manufactured and sold to health care providers or to private individuals for self* 
diagnosis. 

Initially, the screening method involves amplification of the relevant MTS sequences, eg., 
by PCR, followed by DN A sequence analysis* In another preferred embodiment of the invention, 
the screening method involves a non-PCR based strategy. Such screening methods include two- 

15 step label amplification methodologies that are well known in die art Both PCR and non-PCR 
based screening strategies can detect target sequences with a high level of sensitivity. 

The most popular method used today is target amplification. Here, the target nucleic acid 
sequence is amplified with polymerases. One particularly prefe rre d method using polymerase 
driven amplification is the polymerase chain reaction (PCR). A preferred PCR uased strategy 

20 contemplated within die scope of this invention is provided in Example 1. The polymerase chain 
reaction and other polymerase-driven amplification assays can achieve over a million-fold increase 
in copy number through the use of polymerase-driven amplification cycles. Once amplified, the 
resulting nucleic acid can be sequenced or used as a substrate for DNA probes. 

When the probes are used to detect die presenc e of die target sequences (for example, in 

25 screening for cancer susceptibility), the biological sample to be analyzed, such as blood or serum, 
may be treated, if desired, to extract the nucleic acids. The sample nucleic acid may be prepared in 
various ways to facilitate detection of the target sequence; eg. denaturction, restriction digestion, 
electrophoresis or dot blotting. The targeted region of the analyte nucleic add usually must be at 
least partially single-stranded to form hybrids with the targeting sequence of the probe. If the 

30 sequence is naturally single-stranded, denaturatioa will not be required. However, ifftelsequence 
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is double-stranded, the sequence will probably need to be denatured. Denaturation can be carried 
out by various techniques known in the art 

Analyte nucleic add and probe are incubated under conditions which promote stable hybrid 
formation of the target sequence in the probe with the putative targeted sequence in the analyte. 
The region of die probes which is used to bind to the analyte can be made completely 
complementary to the targeted region of human chromosome 9p. Therefore, high stringency 
conditions are desirable in order to prevent false positives. However, conditions of high stringency 
are used only if the probes are complementary to regions of the chromosome which are unique in 
the genome. The stringency of hybridization is determined by a number of factors during 
hybridization and during the washing procedure, including temperature, ionic strength, base 
composition, probe length, and concentration of fbrmamide. These factors are outlined in, for 
example, Maniatis et aL, 1982 and Sambrook et aL, 19891 Under certain circumstances, the 
formation of higher order hybrids, such as triplexes, quadraplexes, etc., may be desired to provide 
the means of detecting target sequences. 

Detection, if any, of the resulting hybrid is usually accomplished by the use of labeled 
probes. Alternatively, the probe may be unlabeled, but may be detectable by specific binding with 
a ligand which is labeled, either directly or indirectly. Suitable labels, and methods for labeling 
probes and Uganda are known in the art, and include, for example, radioactive labels which may be 
incorporated by known methods (e.g^ nick translation, random priming or kinasing), biotin, 
fluorescent groups. chemUununescent groups (e*, dioxetanes, particularly triggered dioxetanes), 
enzymes, antibodies and the like. Variations of tins basic scheme are known in the art, and include 
those variations that facilitate separation of the hybrids to be detected from extraneous materials 
and/or that amplify foe signal from the labeled moiety. A number of these variations are reviewed 
in, eg, Matthews & Kricka, 1988; Landegren et aL, 1988; MMin, 1989; U.S. Patent 4,868,105, 
and in EPO Publication No. 225,807. 

As noted above, non-PCR based screening assays are also contemplated in this invention. 
An exemplary non-PCR based procedure is provided in Example 15. This procedure hybridizes a 
nucleic acid probe (or an analog such as a methyl phosphonate backbone replacing the normal 
phosphodiester), to the low level DNA target This probe may have an enzyme covalently linked 
to the probe, such that the covalent linkage does not interfere with the specificity bT the hybrid- 
ization. This enzymeirobe<onjugate.target nucleic acid complex can then be-isolated swa^fem. ..... 
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the free probe enzyme conjugate and a substrate is added for enzyme detection. Enzymatic activity 
is observed as a change in color development or luminescent output resulting in a lOMo' increase 
in sensitivity. For an example relating to the preparation of oligodeoxynucleotide-aflcaline 
phosphatase conjugates and their use as hybridization probes see Jablonski et aL 9 1986. 
5 Two-step label amplification methodologies are known in the art These assays work on 

the principle that a small ligand (such as digoxigenin, biotin, or the like) is attached to a nucleic 
acid probe capable of specifically binding MTS. An exemplary probe for MTS1 is the nucleic acid 
probe corresponding to nucleotide positions 448 to 498 of SEQ ID NO:4. Allele specific probes 
are also contemplated within the scope of tins example and exemplary allele specific probes 

10 include probes encompassing the predisposing mutations summarized in Table 3 and the somatic 
mutations in tumors summarized in Table 5. 

In one example, the small ligand attached to the nucleic add probe is specifically recog- 
nized by an antibody-enzyme conjugate. In one embodiment of this example, digoxigenin. is 
attached to the nucleic acid probe. Hybridization is detected by an antibody-alkaline phosp hatase 

IS conjugate which turns over a chemihuninescent substrate. For methods for labeling nucleic acid 
probes according to this embodiment see Martin et aL 9 1990. In a second example, the small 
ligand is recognized by a second ligand-enzyme conjugate that is capable of specifically 
complexing to the first ligand A well known embodiment of this example is the biotin-evidin type 
of inter a ctions. For methods for labeling nucleic acid probes and their use in biotin-avidin based 

2 0 assays see Rigby, etah, 1977 and Nguyen, et al. ( 1 992). 

It is also contemplated within die scope of this invention that the nucleic acid probe assays 
of this invention will employ a cocktail of nucleic acid probes capable of detecting MTS genes. 
Thus, in one example to detect the presence of MTS1 in a cell sample, more than one . probe 
complementary to MTS1 is employed and in particular the number of different probes is 

25 alternatively 2, 3, or 5 di fferent nucleic acid probe *f*["*ncgs In another example, to detect the 
presence of mutations in the MTS I gene wpmrcg in a patient, more than one probe comple- 
mentary to MTS1 is employed where the cocktail includes probes capable of binding to the allele- 
specific mutations identified in populations of patients with alterations in MTS1. In this 
embodiment, any number of probes can be used, and will preferably include probes corresponding 

30 to the major gene mutations identified as predisposing an individual to breast cancer? Some 
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candidate probes contemplated within the scope of the invention include probes that include the 
allele-specific mutations identified in Tables 3 and 5. 

Mgthnrte of U «g- Pqrti A» DiRgro^a and Diagnostic Kit* 
5 The neoplastic condition of lesions can also be detected on the basis of the alteration of wild- 

type MTS polypeptide. Such attentions can be determined by sequence analysis in accordance 
with conventional trrhnignrs More preferably, antibodies (polyclonal or monoclonal) are used to 
detect differences in, or the absence of MTS peptide* In a preferred embodiment of the invention, 
antibodies will immunopredphate MTS proteins from solution as well as react with MTS protein 

10 on Western or immunoblots of polyacrylaroide gels. In another preferred embodiment, antibodies 
will detect MTS proteins in paraffin or frozen tissue sections, using immunocytochemical 
techniques Techniques for raising and purifying antibodies are well known in the art and «ny 
such techniques may be chosen to achieve the preparation of the invention. 

P iefeir e d embodiments relating to methods for detecting MTS or its mutations include 

15 enzyme linked immunosorbent assays (ELISAX radioimmunoassays (RIA), immunoradiometric 
assays (IRMA) and immunoenzymatic assays (IEMA), including sandwich assays using 
monoclonal and/or polyclonal antibodies. Exemplary sandwich assays are described by David et 
aL, in US Patent Nos. 4376,1 10 and 4,486,330, hereby incorporated by refe rence, and exemplified 
in Example 18. 

20 

Methods of Use; Drug Screening 

Hie present invention is particularly useful for screening compounds by using the Cdk 

polypeptides or binding fragments thereof in any of a variety of drug screening techniques. 

Preferably, Cdk4 is utilized The Cdk polypeptide or fragment employed in such a test may either 
25 be free in solution, affixed to a solid support, or borne on a cell surface* One method of drug 

screening utilizes eukaryotic or prokaryotic host cells which are stably transformed with 

recombinant polynucleotides expressing the polypeptide or fragment, preferably in competitive 

binding assays. Such cells, either in viable or fixed form, can be used for standard binding assays. 

One may measure, for example, for the formation of complexes between a Cdk polypeptide or 
30 fragment and the agent being tested, or examine die degree to which the formation of a- complex 
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betwecn a Cdk polypeptide or fragment and MTS polypeptide or fragment is interfered with by the 
agent being tested,' 

Thus, the present invention provides methods of screening for drugs comprising contacting 
such an agent with a Cdk polypeptide or fragment thereof and assaying: 1) for the presence of a 
complex between the agent and the Cdk polypeptide or fragment, or 2) for the presence of a 
complex between the Cdk polypeptide or fragment and a ligand, by methods well known in the art 
The activity of Cdk is also measured to determine if die agent is capable of inhibiting Cdk, and 
hence capable of regulating the cell cycle. In such competitive binding assays the Cdk polypeptide 
'or fragment is typically labeled. Free Cdk polypeptide or fragment is separated from that present 
in a protem:protein complex, and the amount of free (Le, uncompleted) label is a measure of the 
b inding of the agent being tested to Cdk or hs interference with CdfcMTS polypeptide binding, 
respectively. Small peptides of MIS polypeptide (peptide mimed cs) are analyzed in this manner 
to identify those which have Cdk inhibitory activity. 

Another technique for drug screening provides high throughput screening for compounds 
having suitable binding affinity to the Cdk polypeptides and is described in detail in Geysen, 
European Patent Application No. 84/03664, published 13 September 1984. Briefly stated, large 
numbers of different small peptide test compounds are synthesized on a solid substrate, such as 
plastic pins or some other surface. The peptide test compounds are reacted with Cdk polypeptide 
and washed Bound Cdk polypeptide is then detected by ; icthods well known in the art. 

Purified Cdk can be coaled directly onto plates for use in the aforementioned drug screening 
♦'"• hm'T't However, non-neutralizing antibodies to the polypeptide can be used to capture 
antibodies to immobilize the Cdk polypeptide on the solid phase. 

The present invention also contemplates the use of competitive drug screening assays, in 
which neutralizing antibodies capable of specifically binding the Cdk polypeptide compete with a 
test compound for binding to the Cdk polypeptide or fragments thereof. In tins manner, the 
antibodies can be used to detect the presence of any peptide which shares one or more antigenic 
- determinants of the Cdk polypeptide. 

A further technique for drag screening involves the use of host eukaryotic cell lines or cells 
(such as described above) which have a nonfunctional MTS gene. These host cell lines or cells are 
defective in cell cycle control at the Cdk level. The host cell lines or cells are grown mine presence 
of drug compound. The rate of growth of the host cells is measured to determine if the compound 
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is capable of regulating the cell cycle. One means of measuring the growth rate is by dete rmining 
the biological activity of the Cdks, preferably Cdk4. 

Methods of Use; Rational Dray pfifflgn 
5 The goal of rational drug design is to produce structural analogs of biologically active 

polypeptides of interest or of small molecules with which they interact (eg, agonists, antagonists, 
inhibitors) in order to fashion drugs which are, for example, more active or stable forms of the 
polypeptide, or which, e.g, enhance or interfere with the function of a polypeptide in vivo. See, 
e.g., Hodgson, 1991. In one approach, one first determines the three-dimensional structure of a 
10 protein of interest (e.g, pl6 or Cdk4) or, for example, of the Cdk4-pl6 complex, by x-ray 
crystallography, by computer modeling or most typically, by a combination of approaches. Less 
often, useful information regarding the structure of a polypeptide may be gained by modeling 
based on the structure of homologous proteins. An example of rational drug design Is the 
development of HIV protease inhibitors (Erickson et aL, 1990). In addition, peptides (e.g, pl6 or 
15 Cdk4) are analyzed by an alanine scan (Wells, 1991). In this technique, an amino add residue is 
replaced by Ala, and its effect on the peptide's activity is determined. Each of the amino acid 
residues of the peptide is analyzed in this manner to determine the important regions of the peptide. 

It is also possible to isolate a target-specific antibody, selected by a functional assay, and 
then to solve its crystal structure. In principle, this approach yields a pharmacort upon which 
20 subsequent drug design can be based. It is possible to bypass protein crystallography altogether by 
generating anti-idiotypic antibodies (anti-ids) to a functional, pharmacologically active antibody. 
As a mirror image of a mirror image, the binding site of the anti-ids would be expected to be an 
analog of the original receptor. The anti-id could then be used to identify and isolate peptides from 
banks of chemically or biologically produced banks of peptides. Selected peptides would then act 
25 as the phannacore. 

Thus, one may design drugs which have, e.g, improved MTS activity or stability or which 
act as inhibitors, agonists, antagonists, etc of MTS activity. By virtue of the availability of cloned 
MTS sequences, sufficient amounts of the MTS polypeptide may be made available to perform 
such analytical studies as x-ray crystallography. In addition, the knowledge of the MTS protein 
3 0 sequence provided herein will guide those employing computer modeling techniques in place of, or 
in addition to x-ray crystallography. 
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M*thnd« of Use! (W Thmpv 

According to the present invention, a method is also provided of supplying wild-type MTS 
function to a cell which carries mutant MTS alkies. Supplying such a function should suppress 
S neoplastic growth of the recipient cells. The wild-type MTS gene or a part of the gene may be 
introduced into the cell in a vector such that die gene remains extrachromosomaL In such a 
situation, the gene will be expressed by the cell from die extrachromosomal location. If a gene 
portion is introduced and expressed in a cell carrying a mutant MTS allele, the gene portion should 
encode a part of the MTS protein winch is required for non-neoplastic growth of the cell. More 

10 preferred is the situation where the wild-type MTS gene or a part thereof is introduced into the 
* mutant cell in such a way that h recombines with the endogenous mutant MTS gene present in the 
cell Such recombination requires a double recombination event which results in the correction of 
the MTS gene mutation. Vectors for introduction of genes both for recombination and for 
extrachromosomal maintenance are known in the art, and any suitable vector may be used. 

15 Methods for introducing DNA into cells such as electroporation, calcium phosphate co- 
precipitation and viral transduction are known in the art, and the choice of method is within the 
competence of the routineer. Cells transformed with die wild-type MTS gene can be used as 
model systems to study cflnwT remission and drug ti eatments which prom ote s uch remission. 

As generally discussed above, the MTS gene or fragment, where applicable, may be 

20 employed in gene therapy methods in order to increase the amount of the expression products of 
such genes in cancer cells. Such gene therapy is particularly appropriate for use in both cancerous 
andpre-cancerous cells, in which the level of MTS polypeptide is absent or diminished compared 
to normal cells. It may also be useful to increase the level of expression of a given MTS gene even 
in those tumor cells in which the mutant gene is expressed at a "normal" level, but the gene product 

25 is not fully functional. 

Gene therapy would be carried out according to generally accepted methods, for example, as 
described by Friedman in Therapy for Genetic Disease, T. Friedman, ed., Oxford University Press 
(1991), pp. 105-121. Cells from a patient's tumor would be first analyzed by the diagnostic 
methods described above, to ascertain the production of MTS polypeptide in die tumor cells. A 

3 0 vims or plasmid vector, containing a copy of the MTS gene linked to expression control -elements 
and capable of implicating inside the tumor cells, is prepared. Suitable vectors are known, syckas . 
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disclosed in U.S. Patent 5,252,479 and PCT published application WO 93/07281 The vector is 
then injected into the patient, either locally at the site of the tumor or systemically (in order to reach 
any tumor cells that may have metastasized to other sites). If the transacted gene is not 
permanently incorporated into the genome of each of the targeted tumor cells, the treatment may 
have to be repeated periodically. Since MTS polypeptides are intimately involved in the control of 
the cell cycle, h is preferred that the MTS gene be introduced with its own regulatory elements, to 
avoid constitutive expression of MTS polypeptide by all cells which take up the gene. 

Gene transfer systems known in the art may be useful in the practice of the gene therapy 
methods of the present invention. These include viral and nanviral transfer methods. A number of 
viruses have been used as gene transfer vectors, including papovavinises (e.g^ SV40, Madzak et 
aL, 1992), adenovirus (Berkner, 1992; Berkner et aL, 1988; Goraglia and Kapikian, 1992; Quantin 
et al., 1992; Rosenfeld et aL, 1992; Wilkinson el aL, 1992; Stratford-Perricaudet et aL, 1990), 
vaccinia virus (Moss, 1992), adeno-assodated virus (Muzyczka, 1992; Ohi et aL, 1990), 
herpesviruses including HSV and EBV (Margolskee, 1992; Johnson et al n 1992; Fink et aL, 1992; 
Breakfield and Geller, 1987; Freese et aL, 1990), and retroviruses of avian (Brandyopadhyay and 
Temin, 1984; Petropoulos et al.. 1992), murine (Miller, 1992; Miller et aL, 1985; Sorge et al., 
1984; Mann and Baltimore, 198S; Miller et al., 1988), and human origin (Shimada et aL, 1991; 
Helseth et aL, 1990; Page et aL, 1990; Buchschacher and Panganiban, 1992). Most human gene 
ther?r>y protocols have been based on disabled murine retroviruses. 

Nonviral gene transfer methods known in the art include chemical techniques such as 
calcium phosphate copretipitation (Graham and van der Eb, 1973; Pellicer et aL, 1980); 
m e cha nic al techniques, for example microinjection (Anderson et aL, 1980; Gordon et al., 1980; 
Brinster et al, 1981; Constantini and Lacy, 1981); membrane fusion-mediated transfer via 
liposomes (Feigner et aL, 1987; Wang and Huang, 1989; Kaneda et aL 1989; Stewart et al., 1992; 
Nabel et aL, 1990; Lim et aL, .1992); and direct DNA uptake and receptor-mediated DNA transfer 
(Wolff et aL, 1990; Wu et al., 1991; Zenke et aL, 1990; Wu et aL, 1989b; Wolff et al., 1991; 
Wagner et aL, 1990; Wagner et aL, 1991; Cotten et aL, 1990; Curiel et aL, 1991a; Curiel et aL, 
1991b). Viral-mediated gene transfer can be combined with direct in vivo gene transfer using 
liposome delivery, allowing one to direct the viral vectors to the tumor cells and not into the 
surrounding nondividing cells. Alternatively, the retroviral vector producer cell -line' can be 
injected into tumors (Culver et aL, 1992). Injection of producer cells would then prpyicte^a.. 
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continuous source of vector particles. This technique has been approved for use in humans with 

inoperable brain tumors* 

In an approach which combines biological and physical gene transfer methods, plasmid 
DNA of any sis is combined with a polylysine-conjugated antibody specific to the adenovirus 
5 hexon protein, and the resulting complex is bound to an adenovirus vector. The trimolecular 
complex is then used to infect cells. • Hie adenovirus vector permits efficient binding, 
inte rnalizati on, and degradation of the endosome before the coupled DNA is damaged. 

Liposome/DNA complexes have been shown to be capable of mediating direct in vivo gene 
transfer. While in standard liposome preparations the gene transfer process is nonspecific. 
10 localized in vrvo uptake and expression have been reported in tumor deposits, for example, 
following direct in situ administration (Nabel 1992). 

MdhQdS Qf Usg - Peptide Therapy 

Peptides which have MTS activity can be supplied to cells which cany mutant or missing 

15 MTS alleles. Hie sequences of the MTS proteins are disclosed (SEQIDN03, SEQ ID NO: 14 and 
SEQ ID NO: 16). Protein can be produced by expression of the cDNA sequence in bacteria, for 
example, using known expression vectors. Alternatively, MTS polypeptide can be extracted from 
MTS-produdng mammalian* cells. In addition, the techniques of synthetic chemistry can be 
employed to synthesize MTS : jrotein. Any of such techniques can provide the preparation of the 

20 present invention which comprises the MTS protein. The preparation is substantially free of other 
human proteins. This is most readily accomplished by synthesis in a microorganism or in vitro. 

Active MTS molecules can be introduced into cells by nricroiiyection or by use of 
liposomes, for example. Alternatively, some active molecules may be taken up by cells, actively or 
by diffusion. Extracellular application of the MTS gene product may be sufficient to affect tumor 

25 growth. Supply of molecules with MTS activity should lead to partial reversal of the neoplastic 
state. Other molecules with MTS activity (for example, peptides, drugs or organic compounds) 
may also be used to effect such a reversal. Modified polypeptides having substantially similar 
function are also used for peptide therapy. 

30 - 
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MffthP H * Qf Use; Inosfonned Hosts 

Similarly, ceils and animals which cany a mutant MTS allele can be used as model systems 
to study and test for substances which have potential as therapeutic agents. The cells arc typically 
cultured epithelial cells. These may be isolated from individuals with MTS mutations, either 
5 somatic or gennline. Alternatively, the cell Kne can be engineered to cany the mutation in the 
MTS allele, as described above. After a test substance is applied to the cells, the neoplastically 
transformed phenotype of die cell is determined. Any trait of neoplastically transformed cells can 
be assessed, including anchorage-independent growth, tumorigenichy in nude mice, invasiveness 
of cells, and growth factor dependence. Assays for each of these traits are known in the art 

l o Animals for testing therapeutic agents can be selected after mutagenesis of whole animals or 

after treatment of gennline cells or zygotes. Such treatments include insertion of mutant MTS 
alleles, usually from a second animal species, as well as insertion of disrupted homologous genes. 
Alternatively, the endogenous MTS gene(s) of the animals may be disrupted by insertion or 
deletion mutation or other genetic alterations using conventional techniques (Capecchi, 1989; 

15 Valancius and Smithies, 1991; Hasty et aL, 1991; Shinkai et aL, 1992; Mombaerts et al., 1992; 
Philpott et aL, 1992; Snouwaert et aL, 1992; Donebower et aL, 1992). After test substances have 
been administered to the animals, the growth of tumors must be assessed. If the test substance 
prevents or suppresses the growth of tumors, then the test substance is a candidate therapeutic 
agent for the treatment of the cancers identified herein. 

20 

The present invention is described by reference to the following Examples, which are offered 
by way of illustration and are not intended to limit the invention in any manner. Standard 
techniques well known in the ait or the techniques specifically described below were ntiliTed. 

25 EXAMPLE! 

Materials and Methods 

A. MTS Pedigrees 

Figures 1A-1D show Kindreds 3137, 3161, 3355 and 1771, respectively. The occurrence of 
30 cancer in these kindreds is shown in the Figures. All melanomas in kindred 3137 xarry the 
susceptible haplotype, and other cancers carrying the susceptible haplotype ere jlso shown fpr this 
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kindred All melanomas in kindreds 3161 and 3355 cany the susceptible haplotype. A mutation in 
MTS was identified-for the cancers in kindred 1771. 

ft.TiimftrCril Lines 

5 Seventy-six m^^n^r^ cell lines were obtained from the Ludwig Institute for Cancer 

Research, Memorial Sloan-Kettering Cancer Center, and 8 melanoma cell lines and five non- 
melanoma lines from the American Type Culture Collection (ATCQ. 

C Preparation and analysis of tumor cdl line DNA 

10 DNA was isolated from cell lines by the addition of approximately lxlO 7 cells to 3 ml lysis 

buffer (0.1 M NaCl; 0.1M TrisHQ pH8.0; 5 mM EDTA; 0.5% SDS\ followed by vortexing and 
incubation at 65° C for 30 minutes. 0.5 ml of 8M KOAc was added, and the reaction was mixed 
and incubated on ice for 30 Hr™*»« After oentrifugation (five minutes at 10,000 x gX the 
supernatant was precipitated with an equal volume of 95% ethanol and centrifuged again (15 

1 5 minutes at 1 0,000 x g). The DNA was resuspended in 50-200 ml H 2 0. 

P. PCR Reactions 

50 ng template was added to 30 pmol of each oligonucleotide primer in a 20 ml reaction 
mixture that contained 0.1 mM dNTPs, 10 mM Tris-HCl (pH8J), 50 mM KC 1 , 2mM MgCl* 
20 0.01% gelatin, and 1 unit Amplitaq polymerase (Peridn-Ehner). Samples were cycled in a Perkin- 
Elmer 9600 thermal cycler 35 times at 94° C for 10 seconds, 55° C for 10 seconds, an^ 
10 seconds. The products were visualized after electrophoresis through either 1.5% agarose 
(SeaKem) or 3% NuSieve 3:1 agarose (FMC BioProducts) by ethidium bromide staining. 

25 EXACa 

Yeast artificial chromosomes (YACs) containing markers in the MTS region were obtained 
by screening the CEPH YAC libraries with IFNA, D9S171, and D9S126 using PGR conditions 
described above. Yeast strains containing YACs were grown at 30° C for three days with vigorous 
shaking in AHC medium (10 g/1 casein hydrolysate-acid; 1.7 g/1 yeast nitrogen base; 5 g/1 
30 ammonium sulfite; 20 mg/1 adenine hemisulfete; 2% glucose; pH- 5.8). Yeasty DNA was 
prepared as deggibed by Ausubel et al., 1992. 
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F. Phage Library Construction 

Yeast genomic DNA containing YAC DNA was digested to completion with BamHI, 
inserted into BamHI-digested EMBL3 phage aims (Promega) using T4 DNA ligase (Boehringer- 
S Mannheim), and packaged in vitro with Gigapack II extracts (Stratageoe). Phage were grown on 
£ colt strain C600. Recombinant phage containing human DNA were identified by hybridization 
with n P-labeled human C 0 t-1 DNA (GDBCO-BRL). Phage including human sequences joined to 
YAC vector (end clones) were identified by screening with PGR fragments containing sequences 
from the YAC left or right aim. Hybridization and washing were carried out under standard 
10 conditions (Middleton et aL, 1991). Positive plaques were picked and purified by replating three 
* times. Phage DNA was prepared using Qiaex columns (Qiagen). 

G. Coamid Library Construction 

Yeast genomic DNA containing YAC DNA was digested partially with Sau3A and 
15 fractionated by size on a linear (10-40%) sucrose gradient, as described in Maniatis et al., 1982. 
SuperCos 1 cosmid vector (Stratagene) was prepared according to manufacturer's directions, mixed 
with insert DNA at a mass ratio of 4:1 (insert-vector), treated with ligase, and packaged in vitro, as 
described above. Cosmids were introduced into DH5a host cells and plated at a density of 2000 
colonies per IS cm petri dish. Colony hybridization was carried out as described above and in 
20 Maniatis etah, 1982. 

h. pi dona 

PI clones spanning the MTS region were obtained from Genome Systems, Ino, St Louis, 
Missouri, by screening with STSs prepared as described herein. DNA from these clones was 
25 isolated by alkaline lysis (Bimboim and Doly, 1979), followed by cesium chloride gradient 
centrifugation (Maniatis et aL, 1982). 

I. Generation nfSTfig 

STSs were generated by sequencing 1.0 mg of PI, cosmid, or template DNA with 
3 0 oligonucleotides complementary to sequences flanking the cloning site of the PI vector^pSacBO), 
SuperCos 1 vector, or the EMBL3 vector. Sequencing was done on an ABI 373 A DNA seque ncing 
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systezn with the PRISM Ready Reaction DyeDeoxy Terminator Cycle Sequencing Kit (ABI). 
STSs were designed to be as close as possible to 20 bp long and to have a Tin as close as possible 
to60°C 

5 I flgmlinC matatiana in MTSI in mclimnmiuptTme ImAeda. 

Genomic DNA from carrier individuals was prepaied from blood using standard methods. 
Primers were designed at intron positions to amplify coding exons 1 or 2 from MTS1, or coding 
exon 2 from MTS2 using 20 ng DNA from each sample. PCR reactions used the standard buffer, 
except DMSO was added to a final concentration of 5%. Cycle sequencing reactions were carried 
10 out using a-P^-dATP on the amplified products using primers positioned at di fferent points in the 
sequence. The sequencing products were analyzed on 6% denaturing polyacrylamide gels by 
loading all the (A) reactions side by side, followed by the (Q reactions, etc All polymorphisms 
were confirmed by sequence analysis of the opposite strand 

15 EXAMPLE 2 

Localization ofMTS Using 
(tactically Linked Mates 

To analyze tumor cell lines for homozygous deletions in the 9p21 region, a set of markers 
20 known to be linked to MTS was utilized. These markers were used originally to demonstrate 
dramatic linkage (LOD score « 12.7) of melanoma predisposition in 10 Utah kindreds and one 
Texas kindred (Cannon-Albright et aL, 1992). The markers included a sequence from the 
a-interferon gene cluster (IFNA) (Kwialkowski & Diaz, 1992) which was the most distal marker 
tested, a proximal marker (D9S104), and four additional markers in between (D9S171, D9SI26, 
25 D9S161, and D9S169) (Cannon-Albright et aL, 1992). From genetic studies, the linear sequence 
of the intervening markers was thought to be: D9S171, D9S126, D9S161, D9S169. The IFNA 
marker consisted of an oligonucleotide primer pair thai amplified two fragments from wildtype 
genomic DNA: a roughly 138-150 bp polymorphic fragment (DFNA-1) that contains a poly(CA) 
stretch, and a roughly 120 bp invariant fragment (IFNA-s). The location of IFNA-s with respect to 
30 IFNA-1 was unknown. 
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Five non-melanoma tumor cell lines reported previously to contain deletions were analyzed 
using genetic markers. Each cell line revealed homozygous deletions of at least one of the markers 
tested (Table 1). No homozygous deletions were identified using D9S161, D9S169 or D9S104. 
The minimum region of overlap among these deletions was flanked by IFNA-1 and D9S171. This 
5 suggested that die region between these two markers contains a geae(s) that is involved in tumor 
suppression, possibly MTS. The genomic region between D9S171 and IFNA-1, particularly in the 
vicinity of IFNA-s, was then studied in further detail 

TABLE 1 

10 Homozygous Deletions in Tumor Cell Lines 

Detected with Genetic M«*er« Linfcerf tn MTS 



Tumor Markers 
15 Cell 

Lines — IFNA-1 IFNA-s rwsni pgsi2fi hqsmi dosict dqsku 

U-138 - ... - + + + 

U-118 - . . + + + 

20 U-87 • + + + + + 

A-172 + . + + + + + 

H4 - • • + + + 

NOTE: All cell lines are either gliomas or neuroblastomas 
2 5 available from the ATCC 



30 



EXAMPLE 3 

Genomic Clones in the MTR Regirm 



To obtain genomic clones of the region surrounding IFNA-s, CEPH YAC libraries were 
screened (Cohen el aL, 1993). Eleven YACs were identified which contained the D9S171 marker 
and 5 that contained IFNA-s. No YACs were isolated which included both D9S171 and IFNA-s 
(Figure 2). Three of the YAC clones (C9, C6, F9) were subcloned into phage and one YAC (C6) 
35 was subcloned into a cosmid vector. These and cosmid clones provided a convenient* way to 
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produce STSs internal to known genetic markers and to expedite the chromosomal walk described 
below. 

To provide an independent- source of genomic DNA for construction of a contiguous 
genomic map of the region and to aid in production of STSs, a chromosomal walk was initiated in 
PI clones from IFNA-s extending toward D9S171. from D9S171 extending back toward IFNA-s, 
and from the YAC C6 ends in both directions A total of 27 PI clones were isolated as part of this 
chromosomal walk (Figure 2). The iordered Pis formed a contiguous assembly trial stretched from 
IFNA-s to D9S1 71 with two gaps. PI clones as well as several phage and cosmid clones were used 
to generate a fine structure map oftheMTS region. 

EXAMPLE 4 
Pj nft ancjuB Analgia of mts Regan 

To construct a more detailed molecular map of the MTS region, additional markers were 
required. DNA sequences obtained fitom the genomic clones were used to design PCR primers for 
STSs. These STSs served in turn to help order the PI and YAC clones. A total of 54 STSs from 
the region between IFNA-s and D9S171 were the primary basis for developing a detailed physical 
map of the MTS region (Figure 2). These STS primer sequences have been deposited in the 
Genome Database. 

The set of new markets stretching from IFNA-s to D9S171 was used to test 84 melanoma 
cell lines for homozygous deletions in the MTS region. A total of 52 lines revealed regions of 
homozygous deletion (Figure 3). Several of the deletions were extensive; for example, 13 lines 
were missing a region that included both 816.7 and 760-L. 

For the purpose of localizing MTS, the most mfrnmarive tumor lines fell into two groups 
(Figure 3)r i) those that contained deletions of c5.1 alone (class 1 1); and, ii) those that contained 
deletions ofc53 alone (class 12). A total of 5 melanoma lines fell into these categories. In all 
cases where deletions were detected, the deletion appeared to be simple; that is, there was no 
evidence of multiple deletion events in the region between IFNA-s and D9S171. Together the 
lines harboring deletions delineated a region of deletion overlap centered around markers c5.1 and 
c5 3, making the development of a complete physical map of the region from IFNApsJo D9S171 
unnecessary? ~ ■ ~ — • \z*&.~ '.'•■„•.• 
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EXAMW.KS 
Identification of Cosmid c5 and PI 

Cfliflnig and PI ACT M Containing MTS 

S 

A. Placement of genetic markers 

Analysis of YAC clones and- of deletions in tumor lines yielded results consistent with the 
genetic placement of markers: EFNA, D9S171, D9S126. Three YACs contained both D9S171 and 
D9S126, while four YACs contained IFNA-1 and IFNA-s (Figure 2). None contained both D9S171 

10 and EFNA. This suggested mat: i) IFNA-1 and IFNA-s are closely linked, and ii) D9S126 and 
D9S171 are linked. These results were confirmed by cell line deletions. Most cell lines that were 
missing D9S171 also lacked D9S126. Conversely, line U-87, although testing positive for D9S171 
and D9S126, lacked IFNA-s and IFNA-1 (Table 1). One melanoma cell line, SK-MEL-5, lacked 
IFNA-s, D9S171 and D9S126, but not IFNA-1. Thus, IFNA-1 must be distal to IFNA-s. Another 

15 melanoma line, SK-Mel-Zan, contained a deletion that included IFNA-1, IFNA-s and D9S171 but 
not D9S126, placing D9S171 between IFNA-s and D9S126. Collectively, these findings support 
the marker order given in Table 1. 

The human a-interferon gene family consists of over 23 genes and pseudogenes located on 
chromosome 9p. This gene cluster has been cloned and sorted into 10 linkage gr ups (Henco et 

20 aL, 1985). The linkage groups have been partially ordered by analysis of deletion losses of 
different a-interfcron sequences in glioma cell lines (Olopade et al., 1992). Glioma line H4 lacks 
both IFNA-1 and IFNA-s. It also lacks sequences from linkage group IV al3, 06 and o20). 
Glioma line A172 contains both IFNA-1 and linkage group IV, but lacks sequences from linkage 
groups I (e.&, al, a 19), m (e.g^ at) and DC (eg., o2) and IFNA-s. This analysis places IFNA-1 

25 distal to linkage groups I, in, and DC, as well as IFNA-s. The distal boundary of the A172 deletion 
was mapped within one PI length distal of IFNA-s. Thus, linkage groups I. ID, and DC must lie 
proximal to a point located less than 85 kb distal to IFNA-s. 

B. Physical distance between genetic markers 

30 The results did not permit a precise estimate of the distance between IFNA-s and D9S171, 

since it was not .possible to isolate YACs that contained both markers. Furthermore, bas ed on 



WO 95/23813 



PCT/CS95/03537 



-51- 

mapping with STSs, none of five YACs that extended distal from IFNA-s overlapped any of the 1 1 
YACs that extended proximal from D9S171. Given that CEPH YAC inserts average under 500 kb 
in length, the distance between IFNA-s and D9S171 is likely to be at least this large. 

The region between IFNA-s and c5.1 was covered by nine walking steps in a PI library. 
Ass umin g that each step is on average the length of half a PI insert, the distance between cS.I and 
IFNA-s is roughly 400 kb. Thus, the tumor suppressor gene tightly linked to c5.I which is deleted 
frequently in melanoma lines must lie about 400 kb proximal to IFNA-s. 

C. Deletions in tumor lines 

Homozygous deletions of the 9p21 region woe found in 57% of the melanoma tumors 
tested. Fourteen tumor lines contained deletions that extended on the pr oximal side through 76G-L, 
and 16 lines contained deletions that stretched beyond 816.7 on the distal side. Assuming the 
deletions are causal, that is, deletions of gene(s) in Ads region contribute to the tumor phenotype, 
the tumor suppressor gene(s) must also lie between 760-L and 816.7. Hie smallest deletions 
involved markers cS.l and cS.3. Of all of the markers tested, c5J was deleted from the largest 
number of lines, 51. Therefore, the most probable position of the tumor suppressor gene(s) is very 
close to cS3 because it is the most frequently deleted marker. Four lines contained deletions of 
cS.3 alone (class 12) and one line lacked cS.l alone (class 11). Both of these markers were present 
on the same cosmid, c5. Thus, it is likely that the tumor suppressor gene(s) includes sequences 
from cosmid cS. PI clones P1062 and P1063 include sequences found in cS and surrounding 
cosmids. TTius, as shown further below,P1062 and PI 063 contam the entire MTS region. 

The results presented in the above Examples are consistent with previous genetic studies of 
MTS, which found the region between IFNA-1 and D9S126 to be the most probable location for 
MTS (Cannon-Albright et aL, 1992). Recent genetic studies have confined the location of MTS 
further using a polymorphic (CA) repeat that lies between IFNA-s and C53 on Pl-452 (Figure 2). 
Analysis of a recombinant chromosome using this marker places MTS proximal to Pl-453. Thus, 
MTS maps within the region where homozygous deletions in melanoma cell lines cluster. 

These results support the view that there is a tumor suppressor locus, MTS, positioned 
somewhere near c5J. All the lines that contained deletions shared a common area of deleted 
DNA, with the-exception of the set whose deletions were restricted to c5.1 or oS3 (cIassesJ4-_and 
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12). There was no indication of non-overlapping deletions in this panel of cell lines other than 
those within cosmid d Therefore, there is no basis to invoke a more complex scheme involving, 
for example, a second tumor suppressor locus in 9p21 distant from cS.l and c5 J. 

The observation that homozygous deletions of 9p21 occur in multiple tumor types suggests 
that the tumor suppressor gene(s) located there may be expressed in a wide variety of tissues. Thus, 
the tumor suppressor gene(s) may be similar to the p53 gene in that it may participate in the 
development of multiple types of cancer (see further data below). Other types of cancer have been 
reported in melanoma-prone families (Nancarrow et aL, 1993; Bergman et aL, 1990). A thorough 
deletion analysis of a wide variety of tumor types using cS.l and cS3 (shown below) clarifies the 
importance of this tumor suppressor gene in tumors QtW thm* melanoma 

Some of the homozygous deletions observed remove many genetic markers; Fo untain et aL 
reported that homozygous deletions of chromosome 9p21 in two d i f ferent melanoma lines 
extmded 2-3 Mb (Bergman et aL, 1990). In this study, at least one line, SK-MEL-5, contained 
deletions extending from the most distal marker tested* IFNA-L past D9S126, a region apparently 
too large to be contained on a single YAC The preponderance of large deletions suggests that the 
region surrounding MTS is devoid of genes that are essential to cellular viability. 

EXAMPLE 6 
Isolation ofMTS c*™ka«* 

In the previous Examples, the results of a YAC and PI ' chromosomal walk in the 
neighborhood of MTS were described. Fine structure-mapping experiments with STSs derived 
from cS sequences showed the presence of small, noo-overiapping deletions of c5 sequences in five 
different mel an o ma cell lines. Based on this result, h was probable that a tumor suppressor gene, 
possibly MTS, lay at least partly within c5. 

A further indication that cS contained at least one gene came from analysis of (CpG) 
dinucleotide frequencies in cS and neighboring cosmids. In ™rnmml? t virtually all housekeeping 
genes and nearly half of all tissue-specific genes are ^f^ciittfd with regions unusually rich in 
(CpO) dinucleotides (Bird, 1989; Larsen et aL, 1992). Thus, the presence of such "CpG islands" is 
indicative of genes. Cosmids cS, cl2, c57, and c59 were digested with the. restriction 
endonucleases EagI, BssHL and SacII, enzymes whose recognition sequences include two (CpQ) 



WO 95/25813 



PCT/OS95/03S3T 



-53- 

pairs. Only cosmids c5 and cl2 contained sites for these enzymes. Cosmid c5 contained one EagI 
site, at least 10 BssH sites, and at least 12 SacH sites. The presence of the CpO islands in c5 and 
the overlapping cosmid cl2 suggested that c5 indeed contained at least one candidate gene for 
MIS. 

To search for MTS, the DNA sequences of EcoRI fragments from cosmid c5 were 
determined When these sequences were compared against sequences from GenBank, two distinct 
regions of cS were identified that wise similar to a region of a previously identified gene encoding 
h uman cyclin-dependent kinase 4 (Cdk4) inhibitor, or pl6 (Serrano et at, 1993). These two genes 
'were candidates for MTS, and were named MTS1 and MTS2. MTS1 was located near the end of 
cosmid c5 closest to the chromosome 9p telomere, while MTS2 was located near the centromeric 
end of c5. See Figure 4B. A cosmid map showing the position of MTS1 and MTS2, as well as Pis 
1062, 1063 and 1069 is shown in Figure 4A. 

A detailed comparison of genomic sequence of MTS1 from c5 with the pl6 mRNA 
sequence revealed that MTS1 contained a stretch of 307 bp that was identical to a portion of the 
pl6 coding sequence. This stretch of nucleotides in MTS1 was flanked by recognizable splice 
junction sequences. Further characterization of MTS 1 showed that it included the entire coding 
sequence of pl6 plus two introns (Figures 5 A and 5B and Figures 6A and 6B). Intron 1 was 
l oca ted 126 bp downstream from the translational start site; intron 2 was located 11 bp upstream 
from the translational stop site. The two introns divided die coding sequence of MTS 1 into three 
regions, a 5' region of 126 bp (coding exon 1), a middle region of 307 bp (coding exon 2), and a 3' 
region of 11 bp (coding exon 3). SEQ ID NO:3 sets forth nucleotide sequence for the 5* region, 
exon 1 and part of intron 1 for MTS 1. SEQ ID NO:4 sets forth the nucleotide sequence for part of 
intron 1, exon 2 and part of intron 2 forMTSl. 

MTS2 contained a region of DNA sequence nearly identical to pl6 that extended from the 5' 
end of coding exon 2 roughly 211 bp toward intron 2 (Figure 7A). However, the sequence 
similari ty decreased until a point 51 bp upstream of intron 2 in MTS1 which corresponds to the 
location of the final codon of MTS2 (Figure 8). Comparison of sequences from MTS1 and MTS2 
(Figure 8) showed that the sequence similarity between these two genes also extended nearly 40 
nucleotides upstream from the 3* splice junction of intron 1. Thus, portions of noncoding DNA 
were more conserved than some areas of presumptive coding DNA. To exclude the possibility that 
the sequence divergence in coding DNA might be a cloning artifact, PCR primers were designgj-to 
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amplify specifically across the sequence divergence point of MTS2. These primers amplified a 
fragment of the predicted sire from cosmid, PI and genomic DNA. Therefore, the divergent 
sequence located near the 3' end of exon 2 in MTS2 is a bona fide genomic sequence. SEQ ID 
NO:5 sets forth the nucleotide sequence for part of intron 1, "exon 2" and "intron 2" for MTS2. 
SEQ ED NO: 15 sets forth the cDNA sequence for MTS2. 

The occurrence of two closely related genes on cosmid c5 suggested that other related genes 
might exist in this region. To test this possibility, Southern blots were prepared from restriction 
enzyme digests of cosmids c5, cl2, c59, Pis 1063 and 1060, and human genomic DNA- These 
blots were probed with a fragment containing most of exon 2 from MTS1, including the region 
shared with MTS2. Two EcoRl fragments were detected with the probe in both cloned DNA and 
genomic DNA. This result was consistent with the presence of two pl6-like genes in die genome, 
MTS1 and MTS2. It is also consistent with the now known presence of MTSlEip which is an 
al ternate fonn of MTS1 - containing Exons 2 and 3 but not Exon 1 ofMTSl. 

EXAMELLZ 

ISOlatiCD. nnd Stnicture ofMTSl El fl 

T^nlntionnfMTSIElfl 

Clones that contained MTSlEip were isolated by hybrid selection using the complete MTS1 
cDNA as a probe and by conventional cDNA library screening. Conventional cDNA library 
screening was perfonned using a probe derived from exon 2 ofMTSl. One million clones were 
screened from each of fetal brain, normal breast and lymphocyte-derived libraries. A hybridizing 
cDNA clone was isolated from the lymphocyte library. Hie clone was sequenced and shown to 
contain Eip. It also contained exon 2 (E2) and exon 3 (E3) ofMTSl. Hybrid selection-derived 
cDNA clones were isolated by incubating cDNA derived from ovarian tissue with cosmid cS. The 
cosmid was labeled with bioun and made to be single-stranded. Hybrids between c5 and the 
cDNA were allowed to form and then the Hotinylated cosmid was captured using streptavidin- 
coated magnetic particles. The selected cDNA was eluted from the cosmid, amplified by PCR, 
cloned and sequenced. The cDNA clones were similar to those isolated by library screening in that 
they contained El P, E2 and E3. None of the clones contained the previously described^Xon 1 (see 
SEQ ID N03J. The sequence for MTSlEip cDNA is set forth vs. SEQ ID N0:13.— .. . 
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^mirTnrrr fMTS1ElB 

MTSl and MTS lEip are two fonns of a single gene: the two forms both utilize exons 2 and 
3 but have different first exons. MTSl contains the a form (Ela) which encodes the first 43 
amino acids of die pl6 protein encoded by MTSl. MTSlElp contains die p form (Eip) ofexon 1. 
5 The exon structure of die pi 6 gene was determined by comparing die sequence of the composite 
cDNA clones to the genomic regions from which they were derived (Figure 13). A combination of 
genomic Southerns, sequence analysis of the genomic region containing P16, and long PGR were 
used to map the positions of the P16 exons (Figure 13). The pl6 gene spans approximately 30 kb 
of genomic DN A. El P is the most 5' of the exons, the order being Elp, Ela, E2 and E3. 
10 Translation of Elp in the pl6 reading frame (extrapolated from the reading frame used in 

1 pl6 coding exons 2 and 3) revealed an in-frame stop oodon positioned only 10 codons upstream of 
the splice junction between Eip and El The position of the stop codon was confirmed by 
genomic and cDNA sequence analysis. The first potential initiation codon, downstream of Ads 
stop was in the p!6 reading frame, immediately 3* of the E1/E2 splice junction. This potential start 
IS codon is flanked by sequences that do not closely resemble the consensus Kozak sequence (Kozak, 
1987). If translated in the p!6 reading frame, the Elp transcript of the pl6 gene would encode a 
protein of 1 05 amino adds. 

Additional analysis of the P cDNA revealed that it possessed a large ORF in a different 
frame than the one used to encode pl6. The ORF (referred to as ORF2) extended t jough Eip and 
20 continued for 67 amino acids into El The entire ORF could encode a protein of 180 amino acids. 
However, die reading frame remained open at the 5 9 end of E 1 p 9 and therefore, may be incomplete. 
S tati stical analysis suggested that an ORF of this size was unlikely to occur by chance in DNA 
composed of random sequence (P-0.003). However, given the base composition of Jhe_ P 
transcript, the probability was higher (PO.16). The predicted polypeptide was not similar to any 
25 previously described protein. 

Identifying the evolutionary conserved portions of Eip might provide clues as to what 
sequences are important for its function. Mouse pi6 cDNAa were isolated by a modified RACE 
technique called Hybrid Capture RACE (HCR) (see Example 12) and compared to the human pl6 
cDNAs. One type of mouse P16 cDNA (the P type) possessed an exon equivalent to human Eip 
30 and an E2 equivalent A second type (the a type) contained an Ela equivalent joined to E2v The 
Ela and E2 mouse exons were 70% identical to their human counterparts- The nuclgtide 
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sequence of the mouse and human Eip exons were 51% identical (Figure 14) and the mouse Eip 
exon also contained stop codons in the reading frame used to encode pi 6. The human and mouse 
polypeptides, deduced from the nucleotide sequence 5' of the stop codon, were completely 
divergent Therefore, it is unlikely that the stop codons in the p!6 reading frame were sequencing 
5 aitificts. 

Given the uncertainty regarding the role of Eip, we analyzed the similarity between the 
mouse and human p transcripts in all three reading frames. Hie mouse and human P transcripts 
contained a large ORF (ORF 2) in a di ffer e nt reading frame than the one used to encode pi 6(ORF 
1). The deduced polypeptides encoded by ORF 2 were 40% identical. However, they were only 

10 28% identical if we restricted the comparison of the ORF 2 peptides to the portion encoded by 
E10. In contrast the mouse and human pl6 sequences were 67% identical. In addition, the 
polypeptides deduced from ORF 2 contained in E2 were as similar (42%) as the polypeptides 
deduced from die third reading frame in E2 (ORF 3). These results suggest that ORF 2 has not 
been selectively maintained and probably does not encode a protein. The secondary structure of 

IS the human and mouse p RNAs were also compared. No striking similarities were identified. 
Collectively, these results suggest that the P transcript is required for PI 6 function by virtue of its 
presence in both mouse and man; and that if it is translated, the encoded protein probably initiates 
at the first methionine in exon 2 • 

20 EXAMPLES 

(Vnnlin.. Mtifatinn* in MTR1 

To teat whether or not MTS1 or MTS2 corresponded to the genetic susceptibility locus MTS, 
genomic DNA was analyzed from eight individuals presumed to cany MTS predisposing alleles 
25 (Cannon-Albright et aL, 1992). DNA sequences from the exons were amplified from each sample 
using oligonucleotide primers (Table 2) derived from intron sequences specific for either MTS1 or 
MTS2. . 
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Primes for Screening Exons in MTS1 


IF 


SEOIRNO: 


Exon 


Gene 


6 


1 


MTS1 


1108R 


7 


1 


MTS1 


42F 


8 


2 


MTS1 


551R 


9 


2 


MTS1 


21F 


10 


2 


MTS2 


50R 


11 


2 


MTS2 


89F 


12 


2 


MTS2 



Exon 1 of MTS1 was amplified using primers IF and 1108R and men sequenced using 
primer 1108R. Exon 2 of MTS1 was amplified using primers 42F and 551R and then sequenced 
using primers 42F and 55 1R Exon 2 of MTS2 was amplified using primers 21F and 50R, 
reamplified using primers 89F and 50R and then sequenced usmg primers 89F and 50R. 

The DNA sequences of these genomic fragments revealed polymorphisms in two of the eight 
individuals. The polymorphisms were not present in any of the other samples, suggesting that they 
were not common in the population. To demonstrate that the polymorphisms wens linked to the 
MTS chromosome and not to the other homolog, genomic DNA from other individuals who cany 
the predisposing allele from each kindred were analyzed. In each case, the polymorphisms 
segregated with the KfTS predisposing allele. The mutation at codon 93 (gly ftp) was found in 
an individual (12821) in kindred 3012. It was also found in affected earner sib (13183) and in 
unaffected carrier cousin (14917), but not in unaffected non-carrier sib (13184). The mutation at 
codon 1 18 (val -> asp) was found in an individual (15635) in kindred 1771. It was also found in 
affected carrier first cousin once removed (10205), affected carrier first cousin (11414) and in 
affected carrier first cousin of 10205 (10146), but not in unaffected non-carrier uncle of 10205 
(10120). 

The polymorphisms were single nucleotide substitutions mat caused amino add chan ges 
CTable 3). The substitutions' involved either the substitution of a large hydrophobic residue for 
small faydrophilic residue, or the substitution of a charged amino acid for a neutral amino acid. 
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IABLEi 
Predisposing Gennline MTS Mutation 



Coding 
Mutation . Effeqf 



G-»T gly-+trp 277 
T-*A val-+asp 353 



• Location of mutation in DNA sequence ofSEQ ID NO:I. 

Exon 2 from MTS2 showed no polymorphisms in the eight samples tested. This suggests 
mat, at least in this set of kindreds, MTS2 does not predispose to melanoma. It is possible that 
MTS2 is involved in other types of cancer based on its similarity to MTS1. It is also possible that 
MTS2 is a nonfunctional gene. 

ThefinoingofgamlhemiitatioiismKfrSl and not in MTS2 in individuals predisposed to 
melanoma is consistent with the analysis of melanoma homozygous deletions. 

20 EXAMPI,F,9 

Analysis nf ih* Epaenfifl »f \f rs fa Titmnr t ; in 

Because of the high frequency of deletions at 9p21 in multiple tumor types, cell lines derived 
from 12 different types of tumor were analyzed lor the presence or absence of NITO^ 

25 sequence-tagged sites (STSs) spaced across the geiie was used to test gemm^ 

cell lines for the presence or absence of the expected fragment (Figures 4A, 4B and 9). The results 
of this study suggested that MTS1 was deleted from a large percentage of tumor lines (Table 4). 
Homozygous deletions occurred in all tumor types tested other than colon and neuroblastoma cell 
lines, the percentage of deletions varied from a low of 25% in lung cancer and leukemia to 94% in 

30 astrocytomas. In total homozygous deletions were detected in 135 of 290 cell lines tested. This 
number yields a minimum estimate of the percentage of tumor lines that harbor deletions because 
the STSs used for the analysis did not cover the entire gene. Thus, certain small deletions could 
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have escaped detection. In addition, lesions such as insertions or deletions of a few nucleotides, 
and nucleotide substitutions, would be missed by this approach. 

TAMPA 

5 Homozygous Deletions mTUmor By Tumor Type 







No. 


No. 


% 








Deletions 




10 




99 


57 


58 




leukemia 


4 


1 


25 




lung 


59 


15 


25 




neuroblastoma 


10 


0 


0 




bladder 


15 


5 


33 


15 


renal . 


9 


5 


56 




astrocytoma 


17 


16 


94 




colon 


20 


0 


0 




breast 


10 


6 


60 




ovary 


7 


2- 


29 


20 


glioma 


35 


25 


71 




osteosarcoma 


_i 


-1 


fifi 




TOTAL 


290 


135 


47% 



25 To improve the estimate of the total somber of ce'l lines containing MTS1 mutations, 34 of 

the cell lines that did not suffer obvious homozygous deletions of MTS1 sequence were examined 
more closely for lesions in MTS1. Sequences comprising neatly 97% of the MTS1 coding 
sequence were amplified and screened for polymorphisms. Eighteen somatic mutations in exon 2 
or exon 1 of MTS1, distributed in 14 out of 34 melanomas, were observed (Table 5). Three of 

30 these mutations were ffomeshifts, 7 were nonsense mutations, 4 were tr iMM)tt mutations and 4 
were silent Three of the 4 lines that contained silent mutations also contained additional mutations 
and 16 of 18 mutations were located in coding exon 2* All but one line contained exclusively 
head- or homozygous polymorphisms, suggesting that the other homologous chromosomes had 
incurred deletions. The single line that was heterozygous contained two different nonsilent 

35 mutations, a finding consistent with the view that each homolog had undergone independent 
mutational events. Based on this DNA sequence and deletion analysis of MTS1, a minimum of 
75% of melanoma lines contained mutant MTS1 or had lost the gene from both homology . 
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TABLE5 
Somatic MTS Mutations in Tumors 



5 


Cell 




Coding 






Tina 




Effect 


Location* 




SK-M-ste 


G-+A 


none 


240 






0-+A 


gly-»ser 


241 


10 


SK-M«swi 


C-*T 


arg — ► stop 


148 


• 


SK-M-ris 


G-*A 


ala-^thr 


418 




SK-M-beh 


5 base deletion 




266-270 




SK-M-178 


C->T 


arg — ► stop 


214 




SK-M-sta 


G->A 


tip — ► stop 


306 


15 


• SK-M-uti 


C->T 


arg — ► stop 


214 




SK-M-EML131 


8 base deletion 


ftttBCflfalft 


148-155 






C-+A 




147 




SK-M-koz(net**) 


C->T 


pro -►leu 


317 






C-*T 


0000 


213 


20 




C-*T 


• arg —► stop 


214 




SK-M-kra 


C-*T 


pro -►leu 


317 




SK-M-kuu 


C->T 


none 


354 




SK-M-mar 


G->A 


op— ►stop 


305 




SK-M-wfai 


C->-T 


gin — ► stop 


124 


25 


SK-M-adl(het) 


2 base deletion 


fiameshift 


104.105 



* Location of mutation in DNA sequence of SEQ ID NO: 1 . 

Het stands for "heterozygote" and refers to the presence in the sample of both 
30 wildtype and mutant sequences. 

The preponderance of lesions in MTS1 (deletions and nucleotide substitutions) indicates that 
MTS1 or a closely linked locus contributes to the tumor phenotypc. Cells that suffer these lesions 
enjoy a selective advantage oyer cells that do not The alternative explanation, that the lesions are 

35 random events that have nothing to do with cell growth, is unlikely for several reasons. First, the 
high contlation between tumor phenotypc and mutation at MTS 1 implies a causal relation between 
MTS1 mutations and tumor formation. Second, MTS1 influences susceptibility to melanoma, and 
thus is implicated independently as a tumor suppressor gene* Third, the biochemical function of 
pl6 as a potent inhibitor of a Cdk neatly fits a model where pl6 acts in vtvo as a general inhibitor 

40 of the onset of DNA replication. 
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It is possible that mutation or loss of MTS1 is a product of cell growth in culture. However, 
a high percentage of primary leukemia cells also contain homozygous deletions of the a-interferon 
gene cluster, a gene family located less than 500 kb from MTS1 (Diaz et aL, 1990). Previous 
deletion studies suggest that deletions of a-interferon genes invariably involve markers that extend 
5 beyond MTS1 toward the centromere (Weaver-Feldhaus et aL, 1994). Because homozygous 
deletions of the MTS1 region occur in primary tumor cells as well as cultured cell lines, the 
deletions observed in tumor cell lines are unlikely to be purely an artifact of cell growth in culture. 
Nevertheless, the question of when MTS1 mutations occur during the progression of tumors will 
be answered best by analysis of primary tumor samples. 

10 

TTicrolcofMTSl in viva 

In all eukaryotic cells, cell division requires passage through two critical decision points: the 
Gl to S transition, where DNA synthesis commences, and the 02 to M transition, where mitosis 
begins. In mammals, the machinery that controls cell division has multiple components, many of 

15 which are related (for review see Shexr, 1993). The Cdks may be at the heart of the control 
apparatus, in that they regulate by phosphorylation a number of key substrates that in turn trigger 
the transition from 01 to S and from 02 to M. The 01 to S transition is perhaps the more critical 
decision point, as it occurs first in the cell cycle. So fir, four types of Cdk have been defined 
(Cdk2-5) that may participate in Gl to S control, as well as a set of positive regulators of these 

20 Cdks (cyclins C, Dl-3, E). Recently several negative regulators have also been identified, 
including pl6, plS, pl8, p20, p21, and p27 (Kong et aL, 1993; Serrano et aL, 1993; Gu et aL, 
1993; El-Diery et aL, 1993; Harper et aL, 1993; Harmon and Beat*. 1994; Polyak et aL, 1994b; 
Toyoshima and Hunter, 1994; Ouan et aL, 1995). These negative regulators appear to act by 
inhibiting the kinase activity of the CDKs. Some of the cell cycle regulators are involved in human 

25 cancers (for review, see Hunter and Pines, 1994). p20 inhibits Cdk2 and possibly other Cdks while 
pi 6 (also called MTS1, CDKN2, or INK4a) inhibits Cdk4 but apparently does not inhibit Cdk2 in 
an in vitro assay (Serrano et aL, 1993). Based on in vitro studies and on its interaction with pS3, 
p21 has been proposed as a general inhibitor of all Cdks (Xiong et aL, 1993). Thus, in vitro, p i 6 
appears more specific than p21. Each of these inhibitors is expected to antagonize entry into S 

30 phase. Also, cyclin Dl or CDK4 is overexpressed in some breast carcinomas and the pJ6 gene is 
mutated or deleted in a large number of cell lines and primary tumors (Buckley et aL, 1993; Calda s 
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et al., 1994; Kamb ct al., 1994b; Mori et aL, 1994; Tarn et al., 1994a). These results suggest that 
certain cyclins and CDKs are protooncogenes and that PI 6 (MTSI) is a tumor suppressor gene. 
The biochemical behavior of pl5, pi 8, p21 and p27 indicate that they too may be tumor 
suppressors, but detailed mutational analysis of their genes in tumors or cell lines has not been 
reported. The results presented here provide evidence that MTSI functions fa vivo as an inhibitor 
of cell division. 

The pl6 gene (MTSI), located in the 9p21 segment of human chromosome 9, is especially 
interesting because it is mutated or homozygouary deleted in a high percentage of some types of 
tumors and tumor-derived cell lines (Caldas et at, 1994; Kamb et aL, 1994b; Mori et aL, 1994; 
Nobori et al, 1994). In addition, MTSI mutations segregate with predisposition to melanoma in 
several kindreds known to cany 9p21 linked w»'«n'>ma susceptibility (Hussussian et aL, 1994; 
Kamb et aL, 1994a). However, there are unresolved questions regarding the role of MTSI in, 
hereditary and sporadic cancer. Several melanoma-prone kindreds with high LOD scores for 9p21 
markers do not reveal mutations in MTSI coding sequences. Also, the preponderance of MTSI 
homozygous deletions in tumors and cell lines is atypical for tumor suppressor gene inactivation 
and may imply the presence of another gene(s) near MTSI which also participates in cancer 
formation. 

Recent reports suggest that some autogenic and antimhogenic signals affect cell cycle 
p. egression, at least in part by regulating the activity of CDK inhibitors (Firpo ct aL, 1994; Harmon 
and Beach, 1994; KatoetaL, 1994; Polyak et aU 1994a; Slingerland et aL, 1994). For example, 
TGFpMnduced cell cycle arrest may be mediated by activation ofpl5 and p27. Conversely, p27 
may be negatively regulated during IL-2-induced myogenic activation of quiescent T lymphocytes. 
Comparatively little la known about the regulation of MTSI. Many recent reports provide 
evidence that MTSI levels may be regulated in part by Rb protein (Serrano et aL, 1993; Li et aL, 
1994a; Tarn et aL, 1994b; Parry et aL, 1995). These and other findings (Serrano et aL, 1995) have 
contributed to a model for MTSI action in which MTSI inhibits CDK4/6 and thereby prevents 
phosphorylation of Rb. Rb in turn participates in a feedback loop to limit the levels of MTSI. 

These results provide genetic evidence for a pre-eminent role of MTSI in control of the cell 
cycle. Moreover, the results suggest that the target of MTSI in vivo is & major fector in 
tumori generis. If MTSI inhibits Cdk4 in vivo and not Cdk2, Cdk4 may be a strong-candidate for 
an oncogene. .The prevalence of mutations in the MTSI gene implies that Cdk4 may sewers a 
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general activator of cell division in most, if not all, cells. Further biochemical studies of the effects 
ofMTSl on different Cdks may help clarify the hierarchy ofCdk activity in both normal cells and 
transformed cells. By analogy with pl6, if p21 acts as a general inhibitor of Cdks, its gene may 
also be lost or mutated in a large percentage of tumors. 
5 If MTSi is a general tumor suppressor active in most normal cells, gennline mutations in 

MTS1 might be expected to predispose to cancers other than melanoma. For example, gennline 
mutations in the pS3 gene such as those found in Li-Fraumeni syndrome increase the likelihood of 
many tumor types inc luding childhood sarcomas, breast cancer (Malkin et aL, 1990). Previous 
studies have found an unusually high incidence of pancreatic cancer in some families that are prone 

10 to melanoma (Bergman et aU 1990; Nancarrow et aL, 1993). This observation accords with die 
finding that homozygous deletions ofMTSl occur in pancreatic tumor lines. It is possible that the 
genetics of KfTSl predisposition may be different from the somatic cell genetics ofMTSl. For 
instance, large deletions that remove many kilobases of DNA from die region surrounding MTSI 
may be lost from the fcrniftn gene pool, due to a selective disadvantage. However, such deletions 

15 may be favored in transformed somatic cells, perhaps because they remove multiple genes. This 
possibility is consistent with the existence of a second gene with striking similarity to MTSI, - 
called MTSI MTS2 is located roughly 12 kb upstream of exon 1 ofMTSl, the first exon of 
MTS2 being roughly 2.5 kb upstream of the second exon of MTSI MTS2 may function in a 
fashion similar to MTSI. Pactions that remove both MTSI and MTS2 might confer a greater 

20 growth advantage to cells than mutations that inactivate either gene alone. Alternatively , die two 
different genes may fraction in a non-overlapping or partly overlapping set of cell types. These 
possibilities remain to be thoroughly explored. 

EXAMPLE 10 

25 Mutational Analysis of MTSI El fl 

Both the pr e pon derance of homozygous deletions which inactivate P16 in tumor derived cell 
lines, and the 9p2J -lintel melanoma-prone kindreds that do not reveal mutations in P16 have led 
others to propose the presence of another gene(s) near P16 which is also involved in cancer 
formation (Cairns etel, 1994; Sprucketal, 1994). IfElp encoded a protein which was involved 

30 in regulating cell growth, then these sequences could contain mutations in either sporadic and/or 
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familial cancer that would have been missed in earlier studies. Therefore, El JJ was screened for 
mutations in cell lines derived from various tumors and in some melanoma prone kindreds. 

Genetic characterization of the melanoma-prone pedigrees has been previously reported 
(Cannon-Albright et aL, 1992). Isolation of genomic DNA from melanoma prone kindreds (Kamb 
5 et ah, 1994a) and from cell lines (Liu et aL, 1995) has been previously described. PGR 
amplification for El 0 was performed using the forward primer (5'«AGTCTGCAGTTAAGG-»3 9 
SEQ ID NO:33) and the reverse primer (5'-OGCTAGAGGCGAATTATCTGT-3 , SEQ ID 
NO 34) for 30 cycles using the following conditions: 97°C for 3 seconds, 6S°C far 10 seconds, 
75°C for 20 seconds. The amplification reactions were diluted 100 fold and amplified again under 

10 the same reaction conditions with die same forward primer and die reverse primer (S 9 « 
CACCAAACAAAACAAGTGCCG-3' SEQ IDNOJS). PCR products were run on a 1% agarose 
gel and were extracted using Qiagen beads (Qiagcn, Inc.), The products were sequenced using the 
Cyclist Sequencing kit (Stxatagene) with the forward primer mentioned above (SEQ ID NO:33). 
No sequen ce variants of Eip were detected in a set of 24 cell lines derived from 4 tumor 

15 types (Table 6) or in 6 melanoma kindreds with significant haplotype sharing among affected 
family members (Caanon-Albrigfat et aL, 1992), but which did not reveal P16 mutations in a 
previous study (Kamb et aL, 1994a). These expe ri ments suggest that mutations in Elf) are not a 
common event during tumor p r o g ressi on, nor are they responsible for 9p21 -linked melanoma 



susceptibility in these kindreds. 

20 TART F ft 

Cell Lines Screened for P.I MutatiOM 

Typ e Number 

lung 3 

bladder 7 

25 glioma 9 

melanoma 1 

total 1 24 



'These cell lines were previously shown not to contain homozygous deletions in the P16 region or 
harbor P16 coding sequence mutations (Liu et aL, 1995). Based on previous results (Liu etaL, 
3 0 1995), a similar number and type of cell lines would have contained 4 point mutations injthe pi 6 
coding sequencetConfined to the bladder and melanoma groups. 
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EXAMPLE 11 

Mutation Screening <%f MTS? 
MTS2 mutation seaming ™ BCll '"tea 

The preponderance of homozygous deletions that remove MTS1 in tumor derived cell lines 
may suggest the presence of another gene or genes near MTSI which are also involved in cancer 
formation. If the MTS2 gene woe involved in sporadic cancer, it might contain Mutations in cell 
lines of tumor origin. Therefore, MTS2 coding sequences were screened for mutations in a set of 
tumor cell lines. 

PCR amplification for exons 1 and 2 of MTS2 were performed as described in Kamb et aL 
(1994a). The primer pair 2E1 Jl (5NAGGGAAGAGTGTCGTTAAG-3* SEQ ID NO:19) and 
2E1.R2 (5*-AGACTCCTGTACAAATCTAC-3* SEQ ID NO20) was used to obtain exon 1. 
Primer pair 89F (SEQ ID NO:12) and 50R (SEQ ID NO:ll) was used to obtain exon I After 
amplification, the DNA products were run on a \% agarose gel and were extracted using Qiagen 
beads (Qiagen, Inc.). The products were sequenced "ti ng the Cyclist Sequencing kit (Stratagene) 
with primer 2E1J1 for exon 1 and 89F and SOR for exon 2. 

MTS2 coding sequences were screened for mutations in a set of cell lines derived from 
bladder, glioma, astrocytoma, lung, renal and melanoma tumors. All these cell line types contain 
homozygous deletions of MTS2 and MTS1 at high frequencies (Kamb et aL. 19>4bX Cell lines 
derived from me l a noma, lung, renal, and bladder carcinoma have been shown to contain point or 
frameshift mutations in MTS1 (Liu et aL, 1995). Glioma and astrocytoma cell lines, however, 
have not been shown to contain such MTS2 natations. The particular cell lines used in these 
screening experiments were selected from a group shown previously not to harbor homozygous 
deletions of MTS2 and MTS1 sequences (Kamb et aL, 1994b). 

No MTS2 mutations were found in MTS2 coding sequences in any of the 58 cell lines that 
were screened (see Table 7). Based on previous studies of MTS1 in these cell line types, the set 
would be expected to contain about 8 MTS1 mutations confined to the bladder, melanoma, lung, 
and renal group (Liu etaL, 1995). Thus, no evidence for somatic mutations in MTS2 was obtained 
from this set of tumor cell lines. 
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TABLE7 

Mutation Screening nfMTST. in (VII T inea 



Cell Line Type 

Astrocytoma 

Bladder 

Glioma 

Melanoma 

Renal 

Lung 

Small Cell Lung 
Non-small Cell Lung 
Total 



# Screened 
2 
4 
6 
17 
4 
10 
7 

1 



58 



Polymorphisms 1 
# of Changes Type of Change Coding Effect 



0 

1 

0 

2 

1 

I 

2 



G->A;C-+A 

G-tA:C-»A 
G-*A;C-+A 
G-*A;C-»A 
G-*A;C->A 



None 

None 
None 
None 
None 



1 These common polymorphisms (Kambetai, 1994a) are located in intron 1 near the 3' acceptor 
site at nucleotide positions -27 (C to A) and -1 03 (O to A). 



MTS2 mutation srw»ning in lriwrfwria 

20 The possibility that the MTS2 geoe accounts for the melanoma susceptibility in the 9p21- 

linked, melanoma-prone kindreds that do not have MTS1 coding wnptfncf mutations is attractive. 
Genetic characterization of the melanoma-prone pedigrees has been previously reported (Cannon* 
Albright etaL, 1992). Genomic DNA from family m em b ers was isolated from lymphocytes which 
had been separated from whole blood using standard procedures (Kamb et aL, 1994a). Screening 

25 was performed as described above far mutations in MTS2 coding sequences in 6 kindreds with 
high LOD scores for 9p21 -linked predisposition to melanoma but which did not reveal MTS1 
mutations in a previous study (see Table 8) (Kamb et aL, 1994a). No mutations in MTS2 were 
dete ct ed. These experiments thus provide no evidence that MTS2 lesions contribute to hereditary 
melanoma although such a possibility cannot be ruled out simply based on these limited 

30 experiments. . 
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TABI.fi 8 

Melanoma.pmne Kindreds Screened far MTS2 yermlm* Mutations 

Cases with 



Kindred 


LOD Score 


Total Cass 




3346 


5.97 


21 


21 


3137 


1.9 


17 


21 


1764 


1.04 


4 


4 


3006 


0.19 


6 


3 


3161 


•0.01 


10 


8 


3343 


-0.53 


10 


8 



EYAMW.r.1^ 
Expression of MTS 1 and MTS1 V] ft RNAil 

TwnPlrtpmtnftli^B 

The two different forms of the P16 mRNA could be generated in two possible ways. 
Transcription could initiate from different promoters, or the mRNA could be derived from a single 
promoter and then alternatively spliced to generate the different forms of the transcript 

Evidence for separate o transcript and (J transcript promoters was obtained by demonstrating 
that the a form was transcribed in cell lines even when the upstream Elf) seque nces were deleted. 
Cell lines A375 and SK-mel 93 contain a deletion with one breakpoint between Elo and E10 
(Figure 13). The proximal breakpoint has not been precisely mapped in either cell line, but was at 
least 85 kb upstream of the 5* end of El p. Using RT-PCR with a-specific primers, both of these 
cell lines were shown to express the a transcript (Figure 15). The procedure for the RT-PCR is as 
follows: cDNA was synthesized from total RNA (Sambtook et aL, 1989) isolated from T cells, 
cell lines, or human tissues (Clontech). The cDNA reactions employed random 9 men to prime 
DNA syntheasand Superscript 0 reverse transcriptase (Bethesda Research Laboratories). cDNA 
yields were calculated by including a"P-dATP (Amersham) in the synthesis reaction (0.1 
Ci/mmole) and determining the amount of radioactive nucleotide incorporated into the final 
product P16 a and P16 B transcript levels were analyzed by PCR using a or 0 specific , forward 
primers and heminested reverse primers from E2 in two successive rounds of amplificatio n. In t he 
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initial amplification, 2 ng of cDNA was amplified with the a-specific primer AS.1 (5'- 
CAACGCACCGAATAGTTACG-3' SEQ ID NO;26) or the p-specific primer BS.l (5'- 
TACTGAGGAGCCAGCGTCTA-3 1 SEQ ID NO:27) and X2JU40* (5 f - 
AGCACCACCAGCGTGTC-3 • SEQ ID NO:22)* The reactions were done on a Perkio-Elmer 
5 9600 thermal cyder for 20 cycles under the following conditions: 97°C for 3 seconds; 65*C for 10 
seconds; 7S°C for 20 seconds. These reactions were diluted 100 fold and ^amplified with AS.l or 
BS.l and X2B (S'^GTGTCCAGiGAAGCCC-r SEQ ID N023). The X2B oligo was 
radiolabeled at its 5* end (Sambrook et aL, 1989) with y^P-dATP (DuPont). PGR conditions, were 
as above, but for only 15 cycles. To eliminate problems due to genomic DNA contamination, the 

10 PCR products spanned the Ela or Eip/E2 splice junction. The products were resolved by 
electrophoresis through a denaturing 5% polyacrylamide gel Dried gels were exposed to X- 
OMAT (Kodak) film overnight 

The results suggest that the a transcript initiates from a promoter that is independent of 
sequences 5' of Elp. An alternative explanation is that the deletions fused ectopic promoter 

15 sequences to Ela. However, this seems unlikely given that A375 and SK-mel 93 are 
independently isolated cell lines. The exact location of the a promoter is not clear, but RNase 
protection analysis indicated that h initiated at least 440 bp upstream of the pl6 initiation codon. 
Thus, the human pl6 gene is complex, with two partially overlapping transcripts with distinct 
coding potential, produced from separate promoters, P a and Pp. 

20 Expression pattern of P16 

Clues to the function of genes may emerge from analysis of their expression pattern in 
different t iss u es. To determine the expression pattern of P16, a set of cDNA samples prepared 
from eleven tissues were screened by PCR with a and p specific primers (Figures 16A-D). Both 
forms of P16 transcript were detected in all tissues examined, though there were some differences. 

25 For example, in spleen the ratio between the a and p forms was skewed toward p. In contrast, the 
ratio in breast favored gl These expression data are consistent with studies which found deletions 
and point mutations of PI 6 in cell lines derived from many different tissue types (Kamb et aL, 
1994b; Liu et al. f 1995) in that they suggest roles for pl6 in multiple tissues. 

Given the biochemical function of pi 6, demonstrated in vitro to be an inhibitor of CDK4 and 

30 CDK6 (Serrano et al., 1993; Li et al., 1994a; Parry et aL, 1995), the expression bf PIS was 
analyzed as cells' traversed the cell cycte. Human peripheral blood lymphocytes (PBLs) $S8T r 
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stimulated by phytohemaglutinin (PHA) plus interleukin-2 (IL-2), and cells were harvested at 
different times after stimulation. These cells were analyzed by flow cytometry to determine their 
cell cycle stage, by RT-PCR to detennine the relative levels of P16 gene expression, and by 
Western blot to detennine the levels of pl6 protein. The peripheral blood lymphocytes were 
5 isolated fiom blood drawn fiom normal adult donors and partially purified by floatation on Ficoll- 
Hypaque gradients (Boyum, 1968). The lymphocytes were further purified by counter current 
ehitration as peviously described (Hstad et aL, 1988). These authors estimated that a ceU 
population, prepared* this ii^ The purified cells were grown in 

RPMI (Qibco) supplemented with 10% fetal bovine serum. Quiescent cells were induced by 
10 lOug/ml PHA (Sigma) and 10 U/ml IL-2 (Sigma). Cell cycle progression was monitored by flow 
cytometry. RNA was isolated fiom primary T cells using RNazol B (ONNA/BIOTECX 
Laboratories, Inc.) as described by the manufacturer. The quantitative behavior of the RT-PCR 
was confirmed by creating serial dilutions from the T cell cDNA isolated after induction. The 
amount of target cDNA present in the undiluted sample was quantified by determining the dilution 
15 value at which the target was no longer amplifiable. Although the results from the different PCR 
experiments were in agreement, the dilution experiments suggested that we could only detect 
changes in RNA levels if they were greater than 4 fold. The cDNA samples fiom the Rb* and Rb* 
cell lines were also analyzed in this manner. Human actin was easily detected and present in 
similar amounts fiom each cDNA sample (whether frovi tissues, cell lines, or T cells). 
20 The ratio of the two forms of P16 transcript changed draniatically through the cell cycle 

(Figures 16B-C). Initially, the 6 form was low, but by 30 to 40 hours after stimulation, the level 
began to rise. During this time, the expression level of the a form remained relatively constant, 
perhaps increasing slightly. By flow cytometry, the ratio change was correlated with celkvexiting 
G, and entering S phase. The quantitative behavior of the RT-PCR was examined by template 
25 dilution experiments. Based oh those experiments, RT-PCR was sensitive to four fold or greater 
changes in transcript level. The B induction was estimated to be at least ten fold. Therefore, as T- 
" celis entered the cefcyele they altered the relative amounts of the two forms of the P16 transcript 
so that the ratio changed in the favor of 6. 

We also examined the level of pl6 protein expression as the T-cells traversed the cell cycle. 
30 Protein was isolated fiom the cells at various times after mitogenic induction, and 'the isolated 
protein was subjected to Western analysis. The levels of pl6 wrtein wererietennined usiBj£*jil6. 
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antibody raised against the 20 C-terminal amino acids of the complete polypeptide. As the cells 
exited Go, the level of pl6 protein remained relatively constant Thus, both die pl6-encoding 
RNA (the a transcript) and pl6 protein remained relatively constant during the cell cycle. Others 
have reported a moderate increase in pl6 levels during S phase (Tarn et aL, 1994b). We did not see 
an accumulation of p 1 6, which might reflect differences in pi 6 regulation in different cell types, or 
reflect problems in detecting a two to three fold increase in protein (or cDNA) levels. 
Expression of P16 in tumor cell lines' 

Previous studies have suggested that Rb influences die expression of pl6 (Serrano et aL, 
1993; Li et aL, 1994a; Pany et aL, 1995). We tested the effect of the Rb status of cells on the 
expression of the P mRNA (Figure 16D). cDNA was prepared from a set of cell lines, five of 
which contained wild type Kb protein, and six of which contained non-functional Rb protein (Pany 
et aL, 1995). As expected, a transcript was only detected in Rb-negative lines. However, the 0 
transcript was present in both Rb-poshive and Rb-negadve cell lines. Therefore, in contrast to a, 
expression of the 0 RNA is independent of the mutation state of Rb in tumor-derived cell lines. 

There is evidence that pl6 is a member of a muldgene family (Guan et aL, 1995). By 
analogy with other muldgene families, the members of this family might cany out redundant 
functions, different functions, or function in di ffer e nt temporal or tissue-specific patterns. 
Therefore, given the low level of pl6 protein and apparent lack of P16 regulation by Rb, it is 
possible that P16 does not regulate the cell cycle in T lymphocytes. However, because the 0 
transcript was dramatically induced upon T cell induction, and because P16 is deleted in a high 
percentage of T cell-derived tumors (Hebert et aL, 1994), it seems likely that pl6 carries out an 
important function in human T cells. A dramatic effect of Rb on pl6 has only been observed in 
virally transformed or tumor-derived cell lines. Perhaps PI 6 is regulated in some other manner in 
wild type tissue. 

El 6 is a conserved and regulated part of pi g 

Although the role of the 0 transcript is unclear, the results suggest that it is important for the 
function of the pl6 locus because: (i) Elf} is conserved in mice; Qi) the relative amount of die 0 
transcript is regulated in both a tissue-specific and cell-cycle dependent manner, and (Hi) two cell, 
lines harbor homozygous deletions that remove E10, but not El a These results suggest that £10 
is required for wild-type P16 function. _ mM . 
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Tbe mouse p cDNA was isolated and compared to MTSlEip of humans. Mouse cDNA 
clones were isolated by a modified hybrid selection procedure called hybrid capture RACE (HCR). 
Mouse polyA'-enriched RNA was isolated form breast and thymus tissues. First strand cDNA 
synthesis reactions (Sambrook ct at, 1989) employed random 12 men and Superscript II reverse 
transcriptase (Bethesda Research Laboratories). After second strand synthesis, the cDNAs were 
"anchored" by ligation of a specific double stranded oligo (dsRP.2) (5'- 
TGAGTAG AATTCTAACGGCCGTCATTQTTC-3 ' SEQ ID NO:28) to their 5' ends. The 5' end 
of the second cDNA strand was the only phosphorylated DNA end in the ligation reaction. After 
the ligation the anchored cDNA was purified by fractionation on Sepharose CL4B columns. The 
anchored cDNA was amplified with a P16 specific reverse primer (5' 
AGCGTGTCCAGG AAGCCTTC-3 * SEQ ID NO: 29) and a nested version of RP.2 (RPB) (5*- 
TGAGTAGAATTCTAACGGCCGTCATTO-3' SEQ ID NO:30) followed by capture with a 
biotinylated gene-specific oUgonudeotide (S'-ACTGCGAGGACCCCACTACCTTCTCC-S' SEQ 
ID NO:31) upstream of the reverse primer used in the first amplification. The captured cDNAs 
were amplified again, using RP3 and a gene-specific reverse primer (5'- 
GAACGTTGCCCATCATCATC-3* SEQ ID N032) upstream of the capture oligo. The resultant 
products were gel purified, cloned, and sequenced. The sequence for the mouse P16 
oligonucleotides was determined by cloning and sequencing a mouse genomic done that contained 
yxp ioyiK hybridizing to a human E2 probe at low stringency. 

Comparison of the mouse 0 transcript to the human suggests that Eip does not encode a 
protein. Only the sequence comprising the pl6 reading frame in E2 was rigorously conserved. 
Therefore, if the p transcript were translated, it seems likely that the protein would initiate in E2 
and be translated in the same frame used to encode pl6. The deduced polypeptide would have a 
calculated molecular weight of 10 kDa and retain 2 3/4 of the 4 ankyrin repeats present in pl& 
However, pl5 contains only 3 1/2 ankyrin repeats (Harmon and Beach, 1994), and other proteins 
fold and function with only one or two repeats. Whether a plO molecule exists m vfw and whether 
h inhibits CDK4/6 remain to be tested. 

Function of the ft RNA 

If the role of the p transcript were to inhibit cell growth, we might find mutations which 
disrupt Eip in tumor-derived cell lines. Consistent with this view are two melanoma- cell lines 
with deletions that remove Elp yet continue to express the a transcript Hie pi 6 coding seouence 
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is wild type in these cell lines. Nevertheless, no small genetic lesions in Eip (e.g. base 
substitutions) were found in a set of 25 tumor cell lines. Therefore, it is difficult to conclude dial 
Eip was the target of the homozygous deletions. If the Elf) exon does not encode a protein, small 
genetic lesions may be insufficient to disrupt its function. Alternatively, the target of the deletions 
5 mentioned above might have been some other gene. For example, it is possible that the pi 5 gene 
(MTS2) was the relevant target of the deletions in these melanoma cell lines. In that view, Eip 
was deleted simply because h is closer to P15 than is Ela. However, since we were unable to 
detect PIS point mutations in a variety of cell lines, and because there were no cell lines that 
' contained deletions which specifically removed PIS (Kamb et aL, 1994b), this explanation seems 
10 unlikely. 

The genetic evidence suggests that pl6 and Rb are members of a growth regulatory pathway 
that is often inactivated during tumor progression. If the role of the 0 transcript is to negatively 
regulate cell growth, perhaps it is part of another pathway which must be mutated independently 
from pl6 and Rb. This would explain why deletions which specifically disrupt Eip have only 
15 been seen in Rb" cell lines. Based on its expression pattern, it seems likely that Eip plays a role in 
actively cycling cells. A definitive conclusion on the role of Eip awaits analysis of its expression 
in vivo. 

EXAMPLE 13 

20 E*pt«aton of MTS2 mPNA 

RNA was isolated from cell lines or from primary T cells using RNazol B 
(CINNA/BIOTECX Laboratories, Inc.) as described by the manufacturer. cDN A was synthesized 
from total RNA (Sambrook et aL, 1989) using a random 9 mer to prime DNA synthesis. : cDNA 
yields were calculated by including a n P-dATP (Amersham) in die synthesis reaction (0.1 

25 Ci/mmole) and determining the amount of radioactive nucleotide incorporated into the final 
product MTS2 expression was analyzed by PCR using hrminrstrd reverse primers in two 
successive rounds of amplification. In the initial amplification, 2 ng of cDNA was amplified with 
E1F (5*-TGAGGGTCTGGCCAGC-3' SEQ ID N021) and X2JU40' (5*- 
AGCACCACCAGCGTGTGT SEQ ID NO:22). The reactions were done on a Peridn-Elmer 

30 9600 thermal cycler for 20 cycles under the following conditions: 97°C, 3 seconds,^65°C, 10 
seconds; 75°G,-20 seconds. These reactions were diluted 100 fold and reamplificd-with EUUnd 
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X2B (5*^GTGTCCAGGAAGCCC-3* SEQ ID NO:23). The X2B oligo was radiolabeled at its 5' 
end (Sambrook et aL, 1989) with y"P-dATP (DuPont). PCR conditions were as above, but for 
only IS cycles. The resultant products were resolved by electrophoresis through a denaturing 5% 
polyacrylamide geL Dried gels were exposed to X-OMAT (Kodak) film overnight 

5 gymwrirm in different tissues 

MTS2 was found to be expressed in many tissue types, including those that give rise to 
tumors in which MTS2 is homozygbusly deleted (see Figure 10A). However, there were some 
diffe rences among the tissues. For example, while the MTS2 transcript was easily detected in lung 
t iss u e, it was undetectable in prostrate and brain tissue. In contrast, expression of the closely 
10 related MTS1 gene was detected in all of the tissues examined It is unknown if the tissue specific 
differences in MTS2 RNA levels reflects a tissue specific requirement for the MTS2 protein, 
mtr? aqaeaian throughout the ceil cvde 

If MTS2 regulates important transitions in the cell cycle, its expression might vary through 
the cell cycle. For instance, in normal dividing cells the abundance of p21 mRNA varies as a 

15 function of cell cycle phase (Li et ah, 1994b). To test whether or not MTS2 transcription was 
regulated throu gh the cell cycle, quiescent human T cells were stimulated with PHA and 1L2 and 
monitored at various stages after stimulation (see Figure 10B). No obvious trend in MTS2 
expression level was detected as the cells exited O 0 and passed through the cdl cycle phases. In 
contrast, the expression of t control gene, CDK4, did change as expected (Matsushime et aL, 

20 1992). Thus, no evidence was found for the differential expression of MTS2 mRNA through the 
cycle of normally dividing cells. 

The MTS1 protein has been proposed to participate in a growth regulatory pathway 
involving the retinoblastoma protein Rb (Serrano et aL, 1993; Guan et al., 1995; Serrano et aL, 
1995). Recent work has provided strong circumstantial evidence for the view that expression of 

25 MTS1 is controlled, at least in part, by Rb (Li et aL, 1994a; Pany et aL, 1995). The biochemical 
similarities between MTS1 and MTS2 suggest that MTS2 might also be regulated by Rb. This 
possibility was tested by comparing levels of MTS2 mRNA in Rb positive cell lines and Rb 
negative cell No correlation between Rb status and MTS2 RNA levels was detected (see 
Figure 10C). This suggests that the Rb status of the cell line does not dramatically affect the 

30 abundance of MTS2 transcript Thus, in contrast to MTS1, MTS2 expression may be independent 
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EXAMPLE14 

frtnpie EYpressinn of MTS1 and MTS2 
A 483 bp fragment of MTS1 was generated by a polymerase chain reaction using primers 
5 MTS1J (5 f AAA GGA TCC ATT GCC ACC ATG QAO CCO OCO GCG GGG AGC AGC 
ATG GAG CCT TCG GCT 3') (SEQ ID NO:17) and E3«R (5 f TTT GAA TTC AAT CGG GGA 
TGT CTG 3') (SEQ ID NO:18). Primer MTSLF was designed to include restriction enzyme sites 
near the 3 ' end for future cloning and a Kozak consensus sequence (Kozak, 1 987). Template DN A 
for this reaction was cDNA from breast tissue. This generated fragment was inserted into the 

10 expression vector pcDNA3 (In Vitrogen) which had been digested with EcoRI and BamHL 
' pcDNA3 contains a cytomegalovirus (CMV) promoter and codes for resistance to ampicillin and 
neomycin. The resulting recombinant vector, pcDNApl6, was then inserted by electroporation 
into cell line HS294T. HS294T is derived from a melanoma and contains a homozygous deletion 
of both MTS1 and MTS1 HS294T was grown in DMEM (Gibco) supplemented with 10% fetal 

IS bovine serum, non-essential amino acids, sodium pyruvate, and L-glutamate. The cells were 
grown at 37°C in 5% CO* HS294T was cotransfonned with a 1:4 ratio of pSS (Stratagene), which 
confers hygromycin resistance to transformed cells, and either pcDNA3 (Invitrogen) expression 
vector containing the MTS1 coding sequence inserted downstream of the CMV promoter, or the 
pcDN A3 vector without an insert 

20 The coding portion of MTS2 was similarly cloned into pcDNA3, again forming a Kozak 

sequence, to yield pcDNAplS and was inserted into HS294T. For this, MTS2 cDNA as prepared 
in Example 13 above was used 

Plasmid pSS (Stratagene) which contains the selectable marker for hygromycin resistance 
was simultaneously cotransferred with the pcDNApl6 or pcDNAplS, the pSS being present in the 

2 5 clectroporation at 20 ^g per cuvette. The conditions for electropomtion were 800 nL of cells per 
cuvette at 1.5 x 10* cells/ml and 500 jiF capacitance, 400 volts. Control experiments were 
performed using pcDNA3 plus pSS; The elcctroporated cells (about 400 yL) were placed in petri 
dishes with 300 ng/mL hygromycin. The number of colonies (foci) were counted after 14 days. 
The results are shown in Table 9. 
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TABLE 9 

pcDNApi5 pcDNAp!6 
Mamidi Colonies/plate Cnlonic^plato 

pcDNA3 + pSS 26.6±4.8 175±0.15 

5 pcDNApl5 + pSS 3.810.8 — 

pcDNApl6 + pSS — t.l±0.5 

When a construct containing the entire MTS2 coding sequence fused to the CMV promoter 
was transformed into HS294T, it inhibited colony formation by a factor of seven when compared 

10 to controls, comparable to the effect of ectopic expression of MTS1. This result indicates that 
ectopic expression of MTS2 is sufficient to inhibit cell growth. It is not clear whether the 
transformed cells are arrested in Gl, as seems to result from ectopic expression of MTS1, or 
growth is arre s te d in some other manner. It can be concluded from the data of Table 9 that 
overexpression of pi 5 or pl6 in a cell line which otherwise would lack plS or pl6 expression (here 

15 because of a homozygous deletion) inhibits growth of the cells. The precise mechanism is unclear 
but possibilities are that the plS or pl6 o verexpressi on stops cell division or kills the cell These 
results suggest that plS and pl6 function in vfvo as bona fide tumor suppressor proteins and 
therefore plS and pl6 possibly have therapeutic uses. 

For many reasons MTS2 is an attractive candidate for a tumor suppressor gene. It 

20 possesses extensive sequence similarity to MTS1, it binds to and inhibits CDK function in vitro, 
and ectopic expression of MTS2 inhibits cell growth in vivo. The above results raise die 
possibility that despite the biochemical similarity between MTS2 and MTS1, the two proteins have 
significantly different functions in vivo. Two features of MTS2 suggest that this may be so: i) 
MTS2, not MTS1, is induced by TGFp (Harmon and Beach, 1994) and ii) unlike MTS1, MTS2 

25 transcription appears to be independent of Rb. It is possible that MTS2 is not involved in 
tumorigenesis at all Alternatively, MTS2 may participate in a pathway of tumor suppression 
distinct from the pathway involving MTS1. The elements of this pathway are not known, but it is 
conceivable that some of these elements may mutate at much higher frequencies than MTS2 in 
somatic tissue. In this view, the lack of somatic mutation of MTS2 in no way precludes an 

30 important role in tumor suppression. As noted above, ectopic expression of MTS2* inhibits cell 
growth, a role - consistent with MTS2 being a tumor suppressor. The constant level o£J^£ES2 
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expression during the ceU cycle, and its induction by TGFp, suggest a role for MTS2 in Gl arrest 
and not necessarily in regulating the tuning of events in the cell cycle itself In contrast, the 
regulation of MTS1 expression by Rb indicates that MTS1 may have a role in a cell cycle 
oscillator. It will be important to test the function of MTS2 as a growth control molecule in vivo, 
5 and to dissect the pathway(s) within which MTS2 functions. 

' EXAMPLE 13 
TW Stqi Assay «" TVtecttlift IWiw of MTS in a Sample 

10 Patient sample is processed according to the method disclosed by Autonarakis, et al. 

(1985), separated through a 1% agarose gel and transferred to nylon membrane for Southern blot 
analysis. Membranes are UV cross linked at ISO mJ using a GS Gene Linker (Bio-Rad). MTS 
probe corresponding to nucleotide positions 448-498 ofSEQ ID NO:4 is subcloned into pTZ18U. 
The phagemids are transformed into £ coli MV1190 infected with M13K07 helper phage (Bio- 

15 Rad, Richmond, CA). Single stranded DNA is isolated according to standard procedures (see 
Sambrook, et aL, 1989). 

Blots are prebybridized for 15-30 mm. at 6S°C in 7% sodium dodecyl sulfate (SDS) in 0.S 
M NaP0 4 . The methods follow those described by Nguyen, etaL, 1992. The blots are hybridized 
overnight at 65°C in 7% SDS, 0.5M NaP0 4 with 25-50 ng/ml single stranded probe DNA. Post- 

20 hybridization washes consist of two 30 min washes in 5% SDS, 40mM NaP0 4 at'65°C, followed 
by two 30-min washes in 1% SDS, 40mM NaP0 4 at 65°C 

Next the blots are rinsed with phosphate buffered saline (pH 6.8) for 5 min at room 
temperature and incubated with 02% casein in PBS for 30-60 min. at room temperature and rinsed 
in PBS for 5 min. The blots are then preincubated for 5-10 minutes in a shaking water bath at 

25 45°C with hybridization buffer consisting of 6M urea, 03 M Nad, and 5X Denhardfs solution 
(see Sambrook, et al., 1989). The buffer is removed and replaced with 50-75 ul/cm 2 fresh 
hybridization buffer plus 2.5 nM of the covalently cross-linked oUgonucleotide-alkaline 
phosphatase conjugate with the nucleotide sequence complementary to the universal primer site 
(UP-AP, Bio-Rad). The blots are hybridized for 20-30 min at 45°C and post hybridization washes 

30 are incubated at 45°C as two 10 min washes in 6M urea, lx standard saline citrate (SSQ, 0.1% 



WO 95/25813 



PCTAJS95/03S37 



-77- 

SDS and one 10 min wash in lx SSC, 0. 1% Triton*X.100. The olots are rinsed for 10 min. at room 
temp, with IX SSC- 

Blots are incubated for 10 min at room temperature with shaking in the substrate buffer 
consisting of 0.1 M dietoaw>lamine, 1 mM MgClj, 0.02% sodium aride, pH 10.0. Individual blots 
are placed in heat scalable bags with substrate buffer and 02 mM AMPPD (3<2'« 
spiro adammtaneH nnethoxy^ disodium salt, Bto-Rad). 

After a 20 min, incubation at room temperature with shaking, the excess AMPPD solution is 
removed. The blot is exposed to X-ray film overnight Positive bands indicate the presence of 
MIS. 

FYAMPTJE 16 
r-mwatinn of Pnlyelftiml Antibnriv against MTS 

Segments of MTS coding sequence were expressed as fusion protein in £. colt. The 
ova expressed protein was purified by gel elution and used to immunize rabbits and mice using a 
procedure similar to the one described by Harlow and 1^ 1988. Tliis procedure has been shown • 
to generate Abs against various other proteins (for example, see Kraemer, ex ai, 1993). 

Briefly, a stretch of MTS coding sequence was cloned as a fusion protein in plasmid 
PET5A (Novagen, Inc., Madison, WI). The MTS incorporated sequence includes the amino acids 
corresponding to 448-498 of SEQ ID NO:4. After induction with IPTO, the overexpression of a 
fusion protein with the expected molecular weight was verified by SDS/PAGE. Fusion protein 
was purified from the gel by electroelution. The identification of the protein, as the MTS fusion 
product was verified by protein sequencing at the N-tenninus. Next, the purified protein was used 
as immunogen in rabohs. Rabbits were immunized with 100 ug of the protein in complete 
Freund's adjuvant and boosted twice in 3 week intervals, first with 100 ug of immunogen in 
incomplete Freund's adjuvant followed by 100 ug of immunogen in PBS. Antibody containing 
serum is collected two weeks thereafter. 

This procedure is repeated to generate antibodies against the mutant forms of the MTS 
gene. These antibodies, in conjunction with antibodies to wild type MTS, are used to detect the 
presence and the relative level of the mutant forms in various tissues and biological fluids^ 
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EVAMPrF 1 7 
Tfcngratifln of Monoclonal Antibodies Specific for MTS 

Monoclonal antibodies are generated according to the following protocol Mice axe 
5 immunized with immunogen comprising intact MTS or MTS peptides (wild type or mutant) 
conjugated to keyhole limpet hemocyanin using glutaxaldehyde or EDC as is well known. 

The immunogen is mixed with an adjuvant Each mouse receives four injections of 10 to 
100 fig of immunogen and after the fourth injection blood samples are taken from the mice to 
determine if the serum contains antibody to the immunogen. Serum titer is determined by ELISA 
10 or RIA- Mice with sera indicating the presence of antibody to the immunogen are selected for 
hybridoma production. 

Spleens are removed from immune mice and a single cell suspension is prepared (see 
Harlow and Lane, 1988). Cell fusions are performed essentially as described by Kohler and 
Milstein, 1975. Briefly, P3.65 J myeloma cells (American Type Culture Collection, Rockville, 

15 MD) are fused with immune spleen cells using polyethylene glycol as described by Harlow and 
Lane, 1988. Cells are plated at a density of 2x10 s cells/well in 96 well tissue culture plates. 
Individual wells are examined for growth and the supematants of wells with growth are tested for 
the presence of MTS specific antibodies by ELISA or RIA using wild type or mutant MTS target 
protein. Cells in positive wells are expanded and subcloned to establish and confirm 

20 monoclonality. 

Clones with the desired specificities are expanded and grown as ascites in mice or in a 
hollow fiber system to produce sufficient quantities of antibody for characterization and assay 
development 

25 EXAMPLE 18 

Sandwich Assay for MTS 

Monoclonal antibody is attached to a solid surface such as a plate, tube, bead, or particle. 
Preferably, the antibody is attached to the well surface of a 96-well ELISA plate. 100 jil sample 
3 0 (e.g., serum, urine, tissue cytosol) containing the MTS peptide/protein (wild-type or-mutants) is 
added to the solid phase antibody. The sample is incubated for 2 hrs at room temperature. ^^Not 
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the sample fluid is decanted, and the solid phase is washed with buffer to remove unbound 
material. 100 ul of a second monoclonal antibody (to a different determinant on the MTS 
peptide/protein) is added to the solid phase. This antibody is labeled with a detector molecule (e.g. 
125-1, enzyme, fluorophoie, or a chromophobe) and the soUd phase with the second andbody is 
incubaredfartwofasatroomter^ The second antibody is decanted and the solid phase is 
washed with buffer to remove unbound material. 

The amount of bound label, which is proportional to the amount of MTS peptide/protein 
present in the sample, is quantitated. Separate assays are performed using monoclonal antibodies 
which are specific for the wild-type MTS as well as monoclonal antibodies specific tor each of the 
mutations 

identified in MTS. 

It will be appreciated that the methods and compositions of the instant invention can be 
incorporated in the form of a variety of embodiments, only a few of which are disclosed herein. It 
will be apparent to the artisan that other embodiments exist and do not depart from the spirit of the 
invention. Thus, the described embodiments are illustrative and should not be construed as 
restrictive 
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(2) INFORMATION FOR SEQ ID MOtlx 

(1) S EQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 baee pairs 

(B) TYPE i nucleic acid 

(C) STRANDEDNESS : double 

(0) TOPOLOGY: linear 
(ii) MOLBCUUB TYPE: CDNA 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM i Homo eapiena 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .447 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOil: 

j 

ATO OAS CCT TCO OCT GAC TOO CIO CCC ACQ OCC OCG GCC CCO COT COO 48 
Mac Olu Pro Ser Ala Asp Trp Leu Ala Thr Ala Ala Ala Arg Oly Arg 
1 5 10 IS 

GTAOAQOAOOTOCOOOCOCrOCTOOAOOCOOTOOCOCTOCCCAACOCA 96 
Val Olu Olu Val Arg Ala leu Lau Olu Ala val Ala Leu Pro Asa Ala 
20 25 30 

CCO AAT AOT TAC GOT COO AOO CCO ATC CAO OTC ATO ATO ATO OQC AOC 144 
pro Aan Ser Tyr Oly Arg Arg Pro He Oln Val Met Met Met oly Ser 

3S 40 48 

OCX^OTOOCOCWOCTOCTOCXOCTCCACGaCOCOaAOCCCAACTOC i»a 
Ala Arg Val Ala Olu Leu Leu Leu Leu Hie Oly Ala Olu Pro Aan Cye 
50 SS « 

OCC QAC CCC GCC ACT CTC ACC COA CCC OTO CAC GAC OCT OCC CGO OAO 240 
Ala Asp Pro Ala Thr Leu Thr Arg Pro Val His Asp Ala Ala Arg Olu 
65 70 75 80 
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GGC TTC CTG QAC ACG CTO GTG GTO CTQ CAC CGG GCC GGG GCG CGG CTG 268 
Gly Phe Leu Asp Thr Leu Val Val Leu His Arg Ala Gly Ala Arg Leu 
85 90 95 

GAC GTG CGC GAT GCC TGG GGC CGT CTG CCC GTG GAC CTG GCT GAG GAG 336 
Asp Val Arg Asp Ala Trp Gly Arg Leu Pro Val Asp Leu Ala Glu Glu 
100 105 110 

CTO GGC CAT CGC GAT GTC CCA CGG TAC CTQ CGC GCG GCT GCG GGG GGC 384 

Leu Gly His Arg Asp Val Ala Arg Tyr Leu Arg Ala Ala Ala Gly Gly 
115 120 125 

ACC AflA GGC ACT AAC CAT GCC CGC ATA GAT GCC GCG GAA GGT CCC TCA 432 
Thr Arg Gly Ser Asn His Ala Arg He Asp Ala Ala Glu Gly Pro Ser 
130 135 140 

GAC ATC CCC GAT TGA 447 

Asp He Pro Asp * 

145 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Glu Pro Ser Ala Asp Trp Leu Ala Thr Ala Ala Ala Arg Gly Arg 
1 5 10 15 * 

Val Glu Glu Val Arg Ala Leu Leu Glu Ala Val Ala Leu Pro Asn Ala 
20 25 30 

Pro Asn Ser Tyr Gly Arg Arg Pro He Gin Val Met Met Met Gly Ser 
35 40 45 

Ala Arg Val Ala Glu Leu Leu Leu Leu Els Gly Ala Glu Pro Asn Cys 
50 55 40 

Ala Asp Pro Ala Thr Leu Thr Arg Pro Val His Asp Ala Ala Arg Glu 
65 70 75 80 

Gly Phe Leu Asp Thr Leu Val Val Leu His Arg Ala Gly Ala Arg Leu 
85 90 95 

Asp Val Arg Asp Ala Trp Gly Arg Leu Pro val Asp Leu Ala Glu Glu 
100 105 HO 

Leu Gly His Arg Asp Val Ala Arg Tyr Leu Arg Ala Ala Ala Gly Gly 
115 - 120 125 
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Thr Arg Gly Ser Asn His Ala Arg lie Asp Ala Ala Glu Gly Pro ser 
130 13S 140 

Asp lie Pro Asp * 
14S 

(2) INFORMATION PGR SBQ ID HO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1149 base pairs 

(B) TYPBx nucleic acid 

(C) STRANDEDNBSS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DBA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE x NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME /KEY: intron 

(B) LOCATION: 1..890 

(XX) FEATURE: 

(A) NAME /KEY : exon 

(B) LOCATION: 891.. 1016 

(ix) FEATURE: , 

(A) NAME/KEY: intron 

(B) LOCATION: 1017.. 1149 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 3: 

TCCCCCGCCC GTKTTAAKTA AACCTCATCT TTCCAGAGTC TGTTCTZATA CCAGGAAATG SO 

TACACGTCTG AGAAACCCR GCCCCAGACA GTCGTTTTAC ACGCAGGAOO GGAAGGGGAG 120. 

GGGAAGGAGA GAGCAGTCCT TTTCTCCAAA AGGAATCCTT NGAACTAGGG TTTCTGACTT ISO 

AGTGAACCCC GCGYTCCTGA AAATCAWOGG TTGAGGGGGT AGGGGGACAC TTYCCTAGTC 240 

GYACAGSTRA TTTCQfTYCT CGGTGGGGCT CTCACAMCTA GQAAAGAATW GTTTTOCTTT 300 

TTC TTA TQAT TAAAAGAAGA AGCCATACTT TTCCCTATQA CACCAAACAC CCCGATTCAA 3S0 

TTTGGCAGTT AGGAAGGTTG TATCGCGGAO GAAGGAAACG GGGCGGGGGC GGATTTCTTT 420 

TTTAACAGAG TGAAOGCACT CAAACACOCC TTTGCTGGCA GGCGGOGGGA GCGCGGCTGG 480 

GAGCAGGGGA GGCCGGAGGG CGGTGTGGGG GGCAGGTGGG GAGGAGCCCA GTCCTCCTTC 540 
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CTTGCCAACG CTGGCTCTGG CGAGGGCTGC TTYCGGCTGG TGCCCCCGGG GGAGACCCAA 600 



CCTGGGGCGA CTTCAGGGGT GCCACATTCG CTAAGTGCTC GGAGTXAATA GCACCTCCTC 660 

CGAGCACTCG CTCACAGCGT CCCCTTGCCT GGAAAGATAC CGCGGTCCCT CCAGAGGATT 720 

TGAGGOACAG GGTCGGAGGQ GGCTCTTCCO CCAGCACCGG AGGAAGAAAO AGGAGGGCCT 780 

GOCTGGTCAC CAGAGGGTGG GGCGGACCGC GTGCGCICGO CGGCTGCGOA GAGGGGGAGA 640 

GCAGGCAGCG GGCGGOGGGG AGCAOCATGG AGCCGGCGGC GGGGAOCA0C ATGOAGCCTT 900 

GGGCTGACTQ GCTGGCCACG GCCGCGGCCC GGGGTOGGGT AGAGGA0GTO CGGGCGCTGC 960 

TGGAGGOGGT GGCGCTGCCC AACGCACCGA ATAGTTACGG TCGOAGGCCO ATCCAGGTGG 1020 ■ 

GTAGAGGGTC TCCAGCGGGA GCAGGGGATG GCGGGCGACT CTGGAGGACO AAGTTTGCAO 1080 

GGGAATTGGA ATCAGGTAGC GCTTCGATTC TCCGGAAAAA GGGGAGGCTT CCTGGGGAOT 1140 

TTTCAQAAC 1149 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

Uv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

fix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 1..191 

(ix) FEATURE: 

(A) NAME/KEY: axon 

(B) LOCATION: 192 ..498 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 499.. 1187 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GAATTCATTG TGTACTGAAG AATGGATAGA CAACTCAAGS AGGAAATTGG AAACTGGAAG 60 
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CAAATGTAGG GGTAATTAGA CACCTGGGGC TTGTGTGGGG OTCTGCTTGG CGGTGAGGGG 
GCTCTACACA ASCITCCTTT CCGTCATGCC OGCCCCCACC CTGGCTCTGA CCATTCTGTT 
CTCTCTGGCA GGTCATGATG ATGGGCAOCG CCCGAOTGGC GGAGCTGCTG CTGCTCCACO 
GCGCOOAGCC CAACTGCOCC GACCCCGCCA CTCTCACCCO ACCCGTGCAC GACGCTOCCC 
GGOAGGGCTT CCTGGACACG CTQGTGGTGC TGCACCGGOC CGGGGCGCGG CTOGACGTOC 
OCGATOCCIO GGOCCGTCM CaXWOGACC TGGCTGAGGA CCTGGGCCAT CGCGATOTCG 
CACGOTACCT GCGCGCGGCT GCGCOGGGCA CCAOAGOCM TAACCATOCC CGCMAGATQ 
CCGCGGAAGG TCCCTCAGGT GAGGACTGAT OATCTGAGAA TTTOTACYCT GAGAOCTTCC 
AAAGCTCAGA GCATTCATTT TCCAGCACAO AAAGTTCAGC CCGGGAGACC AGTCTCC6GT 
CTTGCGCTCA GCTCACGCOC CAATOCGGTO GGACOGCCTO AGTCTCCCTA TGCGCCCTOC 
CSCOCACAOC GCGOCAAATO GGAAAJAATC COSAAAIGOA CTTGCGCACQ TGAAAGCCCA 
TTTTGTACGT TATACTTCCC AAAGCAIACC ACCACCCAAA CACCTACCCT CTGCTAGTTC 
AAGGCCTAOA CTGCGGAGCA ATGAAGACTC AAOAGGCTAG AGGTCXW1TO CCCCCTCTTC 
CTCCAAACTA GGGCCAGTTG CATCSACTTA CCAGGTCTGT TTCCTCATTT GCATAOCAAG 
CTGGCIGGAC CAACCTCAGG ATTTCCAAAC CCAATTGTGC GTGGCATCAT CTGOAflATCT 
CTCGATCTCG GCTCTTCTGC ACAACTCAAC TAATCTGACC CTCCTCAflCT AATCTGACCC 
TCCOCTTTAT CCGGTAGAQT TTTCCAOAOC TGCCCCAOGG GGTTCXGOOO ACATCAGGAC 
CAAGACTTCQ CTGACCCTGG CAOTCTGTGC ACCGGAGTTG GCTCCXTTCC CTCTTAAACT 
• TGTGCAAQAG ATCCCTATAG TGAGTCGTAI TATNCCOCCO CGAATTC 
(2) INFORMATION POR SBQ ID HO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1244 but pairs 

(B) TOTS: nucleic acid 

(C) STRAWDHDHBSS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA (genomic) 

(ill) HYPOTHETICAL: HO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1187 
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(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 1..273 

(ix) FEATURE, 

(A) NAME /KEY: miec_RNA 

(B) LOCATION: 274.?529 

(D) OTHER INFORMATION t /note* "Correspond* to »r«?n of SBQ 

ID NO: 4" 



(xi) SEQUENCE DESCRIPTION : SSQ ZD NO: 5: 

GATCATCACT TTACCATCAA C T TTC TT GTC TCTGAACOTT TAOAOAATAA AATGGCATTT 60 

AATTGGTVCT OAOTWTAACC TGAAGCTGOG CT G GGAAACT GOWTTGCATC AQCAADTOAA 120 

GAAACACCAG ACATCAQAOA CCTGAACACC TCTGCACTGO GTQAAAACTT GOCAATXAOG 100 

TOTTTCTTTA AATOCCTCCA CCTGCCTTGC CCCOOCCOOC ATCTCCCATA CCTOCCCCCA 240 

CCCTGGCTCT GACCACTCT6 CTCTCTCTGG CAGGTCATGA TGAT6G6CA0 COCCCGCOTO 300* 

GCGGAGCTGC TGCTGCTCCA CGGCGCGGAG CCCAACTGCO CAOACCCTOC CACTCTCACC 360 

CGACCGGTGC ATGATGCTGC CCGGOAGOOC TTCCTGQACA CGCT G OT G OT QCTOCACCOO 420 

GCCGGGGCGC GGCT6GACGT GCGCGAXGCC TQGCCTCGTC TGCCCOTGOA CTTGGCCGAG 480 

OAOCGGGGCC ACCGCOACOT TGCAGGGTAC CTGCGCACAO CCACGGGGGA CTQACGCCAO 540 

GTTCCCCAGC CGCCCACAAC GACTTTATTT TCTTACCCAA TTTCCCACCC CCACCCACCT 600 

AATTCGATGA AGGCTGCCAA CGGGGAGCGO CGGAAAGCCi GTAAOCCTGC AAOCCTOTCT 660 

GAGACTCACA GGAAGGAGGA GCCOACCGGO AATAACCTTC CATACATRT TTTCTTTOTC 720 

TTATCTGGCC CTCGACACTC ACCATGAAGC GAAACACAGA GAAGCGGATT TCCAGGGATA 780 

TTTAGGAGTQ TOTGACATTC CAGOGGTCGT PPg N TT TTC A GGOTTTTCTG AGGGAAAGTO 640 

CATATGAAAT CCTTOACTGO ACCTGGTGGC TACGAATCTT CCCOATGOAT GAATCTCCCA 900 

CTCCAGCGCT GAOTGGOAGA AGGCAGTGAT TAGCACTTOO GTGACGGCAG TCOATGCGTT 960 

CACTCCAATG TCTGCTOAGO AGTTATGGTG AACCCACAAC TTAGGCCCTA GCGGCAGAAA 1020 

GGAAAACCTO AAOACTGAGG ACAAAGTGGA GGAGGGCCGA GGTGGGCTTC AOTATGTCCC 1080 

CNNCGCCGCT TTAGTTTGAG CGCATGGCAA GTCACATGCG TAAACGACAC TCTCTGGAAG 1140 

CCCTGGAGAC CCTCGCCCAA CTCCACCAGA TAGCAGAGGG GTAAOAGAGG ATGTGCAAGC 1200 

GACGACAGAT GCTAAAATCC CTGGATCACG ACGCTGCAGA GCAC 1244 
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(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base. pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL t NO 

(iv) ANTI- SENSE : NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CAGCACCGGA CQAAGAAAQ 
(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: YES 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GCGCTACCTQ ATTCGAARC 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xl) SEQUENCE DESCRIPTION: SBQ ID NO: 8: 
GGAAATTGGA AACTGOAAGC 
(2) INFORMATION FOR SSQ H> NO:9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 baae paire 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

' ■ . (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bono sapiens 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO:9: 
TCTGAGCTTT GGAAGCTCT 
(2) INFORMATION FOR*SEQ ZD HO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 baae pairs 

(B) TYPE i nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL i NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(Xi) SEQUENCE DESCRIPTION : SBQ ID NO: 10 
GATCATCACT TTACCATCAA C 
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(2) INFORMATION FOR SEQ ID NO:ll: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 base pairs 

(B) TYPE* nucleic acid 

(C) STRANDRDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL t NO 

<iv) ANTI-SENSE t YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(Xi) SEQUENCE DESCRIPTIONS SEQ ZD NO: 11: 
GGGTGGGAAA TTGGGTAAG • 
(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) 6TRANDSDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vl) ORIGINAL SOURCE x 

(A) ORGANISM: Homo sapiens 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TGAGTTTAAC CTQAAGGTGG 
(2) INFORMATION FOR SEQ ID NO 1 13 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1131 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDRA 
(ill) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE i 

(A) NAME/KEY: CDS 

(B) LOCATION: 338.. 655 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

COCOCCTOCO GGGCGGAGAT GGGCAGGGGG CGGTOCQTGG OTCCCAGTCT GCAGTTAAGG 60 

OGGCAGGACT GGC6CT6CTC ACCTCTGQTO CCAAAGGGCO GCGCAOCGOC TOCCGAOCTC 120 

GGCCCTGGAO GCGGCGAGAA CATSGTGC6C AG6TTCATG6 TGACCCTCCG GATTCOGCGC 180 

GCGTGCGGAC CGCCGCGAGT GAGGOTTTTC OTGGTTCACA TCCCGCGGCT CACGGGGGAG 240 

TGGGCAGCAC CAGGGGCGCC CGCCGCTCTG GCCCTCGTGC TGATGCTACT GAGGAGCCAG 300 

CGTCTAGGGC AGCAGCCGCT TCCTAGAAGA CCAGGTC ATG ATG ATG GGC AGC GCC 355 

Met Met Met Gly Ser Ala 
150 155 

CGA GTG GCG GAG CTO CTG CTG CTC CAC GGC GCG GAG CCC AAC TGC GCC 403 
Arg Val Ala Glu Leu Leu Leu Leu His Gly Ala Glu Pro Asn Cys Ala 
160 165 170 

GAC CCC GCC ACT CTC ACC CGA CCC GTG CAC GAC GCT GCC CGQ GAG GGC 451 
Asp Pro Ala Thr Leu Thr Arg Pro Val His Asp Ala Ala Arg Glu Gly 
175 180 185 



TTC CTG GAC ACG CTG GTG GTG CTG CAC CGG GCC GGG GCG CGG CTG GAC 499 
Phe Leu Asp Thr Leu Val Val Leu His Arg Ala Gly Ala Arg Leu Asp 
ISO 195 200 

GTG CGC GAT GCC TGG GGC COT CTG CCC GTG GAC CTG GCT GAG GAG CTO 547 
Val Arg Asp Ala Tzp Gly Arg Leu Pro Val Asp Leu Ala Glu Glu Leu 
205 210 215 

GGC CAT CGC GAT GTC GCA CGG TAC CTG CGC GCG GCT GCG GGG GGC ACC 595 
Gly His Arg Asp Val Ala Arg Tyr Leu Arg Ala Ala Ala Gly Gly Thr 
220 225 230 235 

AGA GGC ACT AAC CAT GCC CGC ATA GAT GCC GCG GAA OCT CCC TCA GAC 643 
Arg Gly Ser Asn His Ala Arg He Asp Ala Ala Glu Gly Pro Ser Asp 
240 245 250 

ATC CCC GAT TGA AAGAACCAGA GAGGCTCTGA GAAACCTCGG GAAACTTAGA 695 
He Pro Asp * 
255 

TCATCAGTCA CCGAAGGTCC TACAGGGCCA CAACTGCCCC" CGCCACAACC CACCCCGCTT 755 
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TCGTAGTTTT CATTTAGAAA ATAGAGCTTT TAAAAATGTC CTOCCTTTTA ACGTAGATAT 815 

AAGCCTTCCC CCACTACCGT AAATQTCCAT TTATATCATT T3TTAXATAT TCTTAXAAAA 875 

ATGTAAAAAA GAAAAACACC GCTTCTGCCT TTTCACTGTG TTGOAOTTTT CTOOAflTOAO 935 

CACTCACGCC CTAAGCGCAC ATTCATOTOO OCATTTCTTO COAOCCTCGC AflCCTCCOOA 995 

AGCTGTCGAC TTCATGACAA GCATTTT67Q AACTA60GAA OCTCAGGGGG GTTACTGGCT 1055 

TCTCT T Q AOT CACACTGCTA GCAAATGGCA GAACCAAAOC TCAAATAAAA ATAAAATTAT 1115 
TTTCATTCAT TCACTC 

(2) INFORMATION FOR SBQ ID K0:14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 105 amino acids 

(B) TYPB: amino acid 
<D) TOPOLOGY: linaar 

(ii) MOLECULE TOT: protein 

<xi) SEQUENCE DESCRIPTION: SBQ ID HO 1 14 1 

Met Met Met Qly ser Ala Arg Val Ala Olu Leu Leu Leu Leu Hia Oly 
! S 10 15 

Ala Olu Pro Asn Cys Ala Asp Pro Ala Thr Leu Thr Arg Pro Val Hia 
20 35 30 

Asp Ala Ala Arg Glu Oly Phe Leu Asp Thr L#a val Val Leu His Arg 
35 40 4S 

Ala Gly Ala Arg Leu Asp Val Arg Asp Ala Trp oly Arg Leu Pro Val 
50 55 60 

Asp Leu Ala olu olu Leu oly His Arg Asp Val Ala Arg Tyr Leu Arg 
€5 70 75 80 

Ala Ala Ala Oly Oly Thr Arg Qly ser Asn His Ala Arg lie Asp Ala 
85 50 95 

Ala Olu Oly Pro Ser Asp He Pro Asp ♦ 
100 10S 

(2) INFORMATION FOR SBQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 751 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSt double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPB: cDNA 
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(iii) HYPOTHETICAL: KO 

(iV) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAKB/XEY: CDS 

(B) LOCATION : 335. -751 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CGGGCAGTGA GGACTCCGCG ACGC G TCCOC ACCCTGOGGC CAGAGCGGCT TTGA0CTCGG €0 

CTGCGTCCGC GCTAGGOGCT TTTTCCCAGA AGCAATCCAG GCGCGCCCGC TGGTTCTTGA 120 

GCGCCAGGAA AAGCCCOGAG CTAACGACCO GCCGCTCGGC CACTGCACGG GGCCCCAAGC 180 . 
CGCAGAAGGA CGACGGGAGG GTAATGAAGC TGAGCCCAGG TCTCCTAGGA AGGAGAOAGT . 240 

GCGCCOOAGC AGCGTGGGAA AGAAGGGAAG AGTGTCGTTA AGTTTACGGC CAACGGTGGA 300 

TTATCCOGGC CGCTGCGCOT CTGGGGGCTG CGGA ATG CGC GAG GAG AAC AAO 352 

Met Arg Glu Glu Asn Lys 
HO 

GGC ATG CCC ACT GGG GGC GGC AGC GAT GAG GGT CTG GCC AGC GCC GOG - 400 
Gly Met Pro Ser Gly Gly Gly Ser Asp Glu Gly Leu Ale Ser Ale Ale 
115 120 125 

GOG CGG GGA CTA GTG GAG AAO GTG CGA CAO CTC CTG OAA GCC GGC GCC 448 
Ale Arg Gly Leu Vol Olu Lys Vel Arg Gin Leu Leu Glu Ale Gly Ale 
130 135 140 

GAT CCC AAC GGA GTC AAC COT TTC GGG AGO CGC GCG ATC CAO GTC ATG 498 
Asp Pro Asn Gly Vel Asn Arg Phe Gly Arg Arg Ale lie Gin Vel Net 
145 ISO 1S5 ICO 

ATG ATG GGC AGC GCC CGC GTG GOG GAG CTG CTG CTG CTC CAC GGC GOG 544 
Met Met Gly Ser Ale Arg Vel Ale Glu Leu Leu Leu Leu His Gly Ale 
165 170 175 

GAG CCC AAC TGC CCA GAC CCT GCC ACT CTC ACC CGA COG GTG CAT OAT 592 
Glu Pro Asn Cys Ale Asp Pro Ale Thr Leu Thr Arg Pro Vel His Asp 
180 185 190 

GCT GCC CGG GAG GGC TTC CTG GAC ACQ CTG GTG GTG CTG CAC CGG GCC 840 
Ale Ale Arg Glu Gly Phe Leu Asp Thr Leu Vel Vel Leu Bis Arg Ala 
195 200 205 

GGG GCG CGG CTG GAC GTG CGC GAT GCC TGG GGT CGT CTG CCC GTG GAC 688* 
Oly Ale Arg Leu Asp Val Arg Asp Ala Trp Gly Arg Leu Pro Val Asp 

210 " 215 220 * "~" ~ 
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TTG GCC GAG GAG CGG GGC CAC CGC OAC GTT GCA GGG TAC CTO CGC ACA 
Leu Ala Glu Glu Arg Gly His Arg Asp Val Ala Gly Tyr Leu Arg Thr 
225 230 235 240 



736 



GCC ACG GGG GAC TGA 
Ala Thr Gly Asp * 
245 



751 



(2) INFORMATION FOR SBQ ZD NO:16: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SSQDBKCB DESCRIPTION: SBQ ID NO: It; 

Met Arg Glu Glu Asn Lys Gly Met Pro Ser Gly Gly Gly Ser Asp Glu 
1 5 10 15 

Gly Leu Ala Ser Ala Ala Ala Arg Gly Leu Val Glu Lys Val Arg Gin 
20 25 30 

Leu Leu Glu Ala Gly Ala Asp Pro Asn Gly Val Asn Arg Phe Gly Arg 
35 40 45 

Arg Ala He Gin Val Met Met Met Gly Ser Ala Arg Val Ala Glu Leu 
50 55 80 

Leu Leu Leu His Gly Ala Glu Pro Asn Cys Ala Asp Pro Ala Thr Leu 
« 70 75 80 

Thr Arg Pro Val His Asp Ala Ala Arg Glu Gly Phe Leu Asp Thr Leu 
85 90 95 

Val Val Leu His Arg Ala Gly Ala Arg Leu Asp val Arg Asp Ala Trp 
100 105 110 

Gly Arg Leu Pro Val Asp Leu Ala Glu Glu Arg Gly Bis Arg Asp Val 
115 120 125 

Ala Gly Tyr Leu Arg Thr Ala Thr Gly Asp ♦ 
130 135 

(2) INFORMATION FOR SBQ ZD NO :17s 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 57 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE i NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM* Homo sapiens 



(Xi) SEQUENCE DESCRIPTION: SBQ ID N0tl7t 
AAAGOATCCA TTGCCACCAT GGAGCCGGCG OCOOGOAOCA GCATCGAGCC TTCQOCT 57 
(2) INFORMATION FOR SSQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 24 base pairs 

(B) TYPE i nucleic acid 
<C) STRANDBDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: SNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: YES 

(Vi) ORIGINAL SOURCE t 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: IS: 
TTTGAATTCA ATCGGGGATQ TCTO 24 
(2) INFORMATION FOR SSQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS i 

(A) LENGTH j 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS t single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(Vl) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
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(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AGGGAAGAOT GTCGTTAAG 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 20 but pairs 

(B) TYPBi nucleic acid 

(C) STRANDEDNRSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: YES 

(vi) ORIGINAL SOURCE t 

(A) ORGANISM: Hone sapiens 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 20: 
AOACTCCTCT ACAAATCTAC 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: IS base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(Xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 21s 
TGAGGGTCTG GCCAGC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDSDNESS: single 

(D) J1OP0L0GY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: YES 

(vi) ORIGINAL SOURCE: 

(A) OROANXSM: Hono sapiens 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 x 
AGCACCACCA GCGTGTC 17 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) topology: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CGTGTCCAGG AAGCCC 16 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 base pairs 

(B) TYPE: nucleic acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 24: ' 
AATTCGGCAC GAGGCAGCAT GGAGCCTTCG GCTGACTGGC TSuCGACGGC CGCGGCCCGG ~ 6d»^ 
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GCTCGGGTAO AGGAGGTGCG GGCCCTGCTG GAGGCGGTGO CGCTGCCCAA CGCACCGAAT 120 
AOTTACG6TC GGAGGCCGAT CCAO X 44 
(2) INFORMATION FOR SEQ ID NO: 25; 

(i) SEQUENCE CHARACTERISTICS i 

(A) LBRGTHs 395 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDMA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE t NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ZD NO: 25: 

AAOAGAGGGT TTTCXTGGTA AAGTTCGTGC GATCCCGGAG ACCCAOOACA GCGTAGCTGC 60 

GCTCTGGCTT TCGTGAACAT GTTGTTGAGG CTAOAOAGGA T CTTG AOAAfl AGGGCCOCAC 120 

CGGAATCCTG GACCAGGTOA TGATGATGGO CAACGTPCAC GTAGCAOCTC TTCIGCTCAA 180 

CTACGGTGCA GATTCOAACT OCGAGOACCC CACTACCTTC TCCCGCCCGG TOCACOACGC 240 
■ 

AGCGCGCGAA GGCTTCCTCffl ACACGCTGOT COTGCTOCAC GGGTCAGGG8 CTCGOCTOOA 300 

TGTCCGCGAT GCCTGGGGTC GCCTCCCGCT CQA C TT OG OC CAAOAOCOGO GACATCAAOA 360 

CATCGTGCGA TATTTGCGTT CCGCTOGGTG CTCTT 395 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS t single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: cDNA 
<iii» HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CAAOGCACCG AATACTTAOO 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base paira 

(B) TYPE: nucleic acid 

(C) STRANDSDNESS: single 

(D) topology : linear 

(ii) MOLECULE TYPE i cDNA 
(iii) HYPOTHETICAL! NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 1 
TACTGAGGAG CCAGCOTCTA 
(2) INFORMATION FOR SEQ ID NO:2St 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE i nucleic acid 

(C) STRANDSDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID 130:26: 
TGAGTAGAAT TCTAACOGCC GTCATTGTTC 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCS CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
AGCGTGTCCA GGAAGCCTTC 
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(2) INFORMATION FOR SEQ ID N0:30: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 bas* Pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTIONS SEQ ID NOi30: 
TGAGTAOAAT TCTAACG6CC GTCAITO 
(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS : 

(A) length: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL t NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
ACTGCGA0GA CCCCACTACC TTCICC 
(2) INFORMATION FOR SEQ ID NOi32« 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GAACGTTGCC CATCATCATC 
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(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE .CHARACTERISTICS: 

(A) LENGTH: IS base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE i NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
AGTCTOCAGT TAAOO 

(2) INFORMATION FOR SEQ ID N0:34: " 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 baae pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

<iij MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : "BS 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GGCTAGAGGC OAATTATCTO T 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: YES 
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(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 35: 
CACCAAACAA AACAAOTGCC O 
(2) INFORMATION FOR SEQ ID NOi36: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 947 base pair a 

(B) TYPE: nucleic acid 

(C) STRANDEDNHSS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo aapiene 

(ix) FEATURE: . 

(A) NAME/KEY: miac feature 

(B) LOCATION: 151 " 

(D) OTHER INFORMATION: /note* "Splice site acceptor." 

(ix) FEATURE: 

(A) NAME/KEY: miac_feature 

(B) LOCATION: 458 ~ 

(D) OTHER INFORMATION: /note. "Splice site acceptor." 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

ATOOAOCCOO CGGCGGQOAO CAOCATGOAG CCTTCOGCTO ACTOOCTOOC CACOGOCOCO 60 

GCCCOGGOTC GGGTAOAGGA GOTOCGGOCO CTGCTSGAQO CGGTOOCOCT GCCCAACOCA 120 

CCGAATAGTT ACGGTCGOAO GCCGATCCAG GTCATQATOA TGGGCAGCOC CCGAGTGGCG 180 

GAGCTGCTGC TGCTCCACGG COOHlAeCCC AACTOCGCCG ACCCCOCCAC TCTCACCCGA 240 

CCCGTGCACO ACGCTGCCCO GGAGGGCTTC CTOOACACGC TGGTGOTGCT GCACCOGGCC 300 

GGGGCGCGGC TGGACGTGCG COATOCCTGO GOCCGTCTGC CCGTOOACCT GGCTGAGOAO 360 

CTOOOCCATC GCGAMTCQC ACOGTACCW COCOCOOCTO COOGOC10CAC CAOAOOCAOT ' 420 

AACCATOCCC GCATAGATGC COCGOAAOOT CCCTCAOACA TCCCCOATTO AAAOAACCAQ 480 

AGAGOCTCTO AGAAACCTCG OOAAACTTAO ATCATCAOTC ACCOAAGOTC CTACAGGOCC VaO 

ACAACTGCCC CCGCCACAAC CCACCCCQCT TTCGTAGTTT TCWMAOAA AATAGAGCTT ~600 : 



21 



WO 95/25813 PCT/US95AJ3537 

-112- 

TTAAAAATGT CCTGCCTTTT AACCTAGATA 'TAAGCCTTCC CCCACTACCG TAAATGTCCA 660 

TTTATATCAT TTTTTATATA TTCTTATAAA AATCTAAAAA AQAAAAACAC CGCTTCTGCC 720 

TTTTCACTOT OTTOQAOTTT TCTGOAflTGA 6CACTCACGC CCTAAOCGCA CATTCATCTG 780 

GOCATTTCTT OCOACCCTCG CAOCCTCCGQ AAQCTC TCQA CTTCATOACA AOCATTTTGT 840 

QAACTAOQGA AOCTCAGGOQ GGTTACTGGC TTClCROAfl TCACACTOCT ACCAAATCCC 900 

AGAACCAAAO CTCAAATAAA AATAAAATTA TTTTCATTCA TTCACTC 947 
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WHAT IS CT.ATMFT> TR« 

1. A method for diagnosing a polymorphism associated with predisposition for cancer 
including melanoma, leukemia, astrocytoma, glioblastoma, lymphoma, glioma, Hodgkm's 
lymphoma, CLL, and cancers of the pancreas, breast, thyroid, ovary, uterus, testis, kidney, 
stomach and rectum, in a mammal comprising: detecting a gexmline attention of the wild- 
type MTS gene or its expression products in a mammal sample, said alteration indicating a 
predisposition to said cancer. 

2. The method of claim 1 wherein said MTS gene is MTS 1. 

3. The method of claim 1 wherein said MTS gene is MTS2. 

4. The method of claim 1 wherein said MTS gene is MTSlElp. 

5. The method of claim 1 wherein said expression product is mRNAofMTSl. 

6. The method of claim 1 wherein said expression product is mRNA of MTS2. 

7. The method of claim 1 wherein said expression product is mRNA of MTSlElp. 

8. The method of claim 5 wherein the alteration of the wild-type MTS gene mRNA is detected 
by hybridization of the mRNA from said tissue sample 

9. The method of claim 6 wherein the alteration of the wild-type MTS gene mRNA is detected 
by hybridization of the mRNA from said tissue sample to an MTS gene probe. 

10. The method of claim 1 wherein an alteration is detected in the regulatory regions of the MTS 
gene. 
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11. The method of claim 1 wherein the alteration of the wild-type MTS gene is detected by 
observing shifts in electrophoretic mobility of single-stranded DNA on non-denaturing 
polyacrylamidc gels. 

12. The method of claim 1 wherein the alteration of the wild-type MTS gene is detected by 
hybridization of an MTS gene probe to genomic DNA isolated from said tissue. 

13. The method of claim 12 wherein the MTS gene probe hybridizes to an exon selected from 
the group consisting of nucleotides 891 to 1016 ofSEQ ID N03 and nucleotides 192 to 498 
ofSEQIDNO*. 

.14. The method of claim 1 wherein the alteration of the wild-type MTS gene is detected by 
hybridization with an allde-specific probe. 

IS. The method of claim 1 wherein the alteration of the wild-type MTS gene is detected by 
amplifying all or part of the MTS gene in said tissue to produce an amplified sequence and 
sequencing the am plified se q ue n ce 

•16. The method of claim 1 wherein the alteration of the wild-type MTS gene is detected by 
amplifying all or part of the MTS gene for a specific MTS allele, 

17. The method of claim 1 wherein the alteration of the wild-type MTS gene is detected by 
moleculariy cloning all or part of the MTS gene in said tissue to produce a cloned sequence 
ami sequencing the cloned sequence. 

18. The method of claim 1 wherein the alteration of the wild-type MTS gene is detected by 
identifying a mismatch between molecules (1) MTS gene genomic DNA or MTS mRNA 
isolated from said tissue, and (2) a nucleic acid probe complementary to the human wild-type 
MTS gene DNA, when molecules (1) and (2) are hybridized to each other to form a duplex. 
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19. The method of claim 1 wherein the alteration of the wild-type MTS gene is detected by 
amplification of MTS gene sequences in said tissue and hybridization of the amplified 
sequences to nucleic acid probes which comprise wild-type MTS gene sequences. 

20. The method of claim 1 wherein the alteration of die wild-type MTS gene is detected by 
amplification of MTS gene sequences in said tissue and hybridization of the amplified 
sequences to nucleic acid probes which comprise non-wild-type MTS gene sequences. 

21. The method of claim 1 wherein the alteration of the wild-type MTS gene is detected by 
screening for a deletion mutation. 

22. The method of claim 1 wherein the alteration of the wild-type MTS gene is detected by 
screening for a point mutation. 

23. The method of claim 1 wherein the alteration of the wild-type MTS gene is detected by 
screening for an insertion. 

24. The method of claim 1 wherein die alteration of the wild-type MTS gene is detected by in 
situ hybridization ^f the MTS gene with nucleic acid probes which comprise the MTS gene. 

25. He method of claim 1 wherein the expression products are protein molecules. 

26. The method of claim 25 wherein the alteration 
immunoblotdng. 

27. The method of claim 25 wherein the alteration of wild-type MTS gene protein is detected by 
immunocytochemistry. 



28. 



The method of claim 25 wherein the alteration of wild-type MTS gene protein is detected by 
assaying for binding interactions between MTS gene protein isolated from said^sye and a 
Cdk. „ 
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29. The method of claim 28 wherein said Cdk is Cdk4. 

30. The method of claim 25 wherein the alteration of wild- type MTS gene protein is delected by 
assaying for the inhibition of biochemical activity of a Cdk. 

31. The method of claim 30 wherein said Cdk is Cdk4. 

32. A method for confirming the lack of a predisposition for cancer at the MTS locus in a 
mammal, comprising: detecting the wild-type MTS gene or hs expression products in a 
mammal sample, the presence of the wild-type gene or expression products indicating no 
predisposition to cancer at the MTS locus. 

33. The method of claim 32 wherein said MTS gene is MTS 1 . 

34. The method of claim 32 wherein said MTS gene is MTS2. 

35. The method of claim 32 wherein said expression product is mRNA of MTS 1. 

36. The method of claim 32 wherein said expression product is mRNAofMTSl 

37. The method of claim 35 wherein the wild-type MTS gene mRNA is detected by 
hybridization of the mRNA from said tissue sample to an MTS gene probe. 

38. The method of claim 36 wherein the wild-type MTS gene mRNA is detected by 
hybri dizati on of the mRNA from said tissue sample to an MTS gene probe. 

39. The method of claim 32 wherein the wild-type MTS gene is detected by hybridization of an 
MTS gene probe to genomic DNA isolated from said tissue. 
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40. The method of claim 39 wherein the MTS gene probe hybridizes to an exon selected from 
the group consisting of nucleotides 891 to 1016 of SEQ ID NO:3 and nucleotides 192 to 498 
ofSEQIDNO:4. 

5 41. The method of claim 32 wherein the wild-type MTS gene is detected by amplifying all or 
part of the MTS gene in said tissue to produce an amplified sequence and sequencing the 
amplified sequence, 

42. The method of claim 32 wherein the wild-type MTS gene is detected by molecularly cloning 
10 all or part of the MTS gene in said tissue to produce a cloned sequence and se q uencing the 

cloned sequence. 

43. The method of claim 32 wherein the wild-type MTS gene is detected by hi situ hybridization 
of the MTS gene with nucleic arid probes which comprise die MTS gene. 

15 

44. The method of claim 32 wherein the expression products are protein molecules. 

45. The method of claim 44 wherein the alteration of wild-type MTS gene protein is detected by 
immunoblotting. 

20 

46. The method of claim 44 wherein the alteration of wild-type MTS gene protein is detected by 

imrnimn cyftKhrnu** , y 

47. The method of claim 44 wherein the alteration of wild-type MTS gene protein is detected by 
25 assaying for binding interactions between MTS gene protein isolated from said tissue and a 

CdL 

48. The method of claim 47 wherein said Cdk is Cdk4. 

30 49. The method of claim 44 wherein the alteration of wild-type MTS gene protein a detected by 
assaying -for the inhibition of biochemical activity of a Cdk. _ ^ 
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50. The method ofclaim 49 wherein said Cdk is Cdk4. 

51. An isolated DNA consisting of SEQ ID NO:l having A at nucleotide 353. 

52. An isolated DNA consisting of SEQ ID NO:l having T at nucleotide 277. 

53. A nucleic acid probe complementary to human altered MTS gene sequences which 
hybridizes to part of the MTS gene, said part of the MTS gene consisting of SEQ ID NO:l 
having A at nucleotide 353. 

54. A nucleic acid probe complementary to human altered MTS gene sequences which 
hybridizes to part of the MTS gene, said part of the MTS gene consisting of SEQ ID NO:l 
having T at nucleotide 277. 

55. A replicative cloning vector which comprises the isolated DNA of claim 51 or 52 and a 
replicon operative in a host celL 

56. An expression system which comprises the isolated DNA ofclaim 51 or 52 operably linked 
to suitable control sequences. 

57. Reaanbinant host cells transformed with the replicative dating vector ofclaim 55. 

58. Reconibinant host cells transformed with the expression system ofclaim 56. 



59. 



A method of producing recombinant MTS polypeptide which comprises cutaning 
c laim 58 under conditions effective for the production of said MTS polypeptide. 
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KTS1 0 TCCCCCGcCCfflVTTAAwTAAACCtCATCTTTCCAGAGTcTCT^ 

j 1- |J ^ .J" - 

„tSl 60 TaCACGTcTCAGAAAC^^ 

if \l « " id I" 

MTSl 120 GGG^GGAGAGAGCAGTCCTTTTcTCC^^ 

ill ill 141 isi"" i?1 • 

HTS1 180 AGTGAACCCCGCG^CCTGAAAATCAvG^TTGA^ 

.__ 1-^ I™ * i u " i 2 l 231 



181 



HTS1 240 GyACAGsTkATTTCGaTyCTCGCreCGGCCT 

241 1*1 261 ill '~ "I 291 



TwCTTTTGCTTT 



MTSl 300 TTCTTATGATTAAAAGAAGAAGCCATACTTTtCCCTATCACACCA 

ill ill III "1 341 351 

MTSl 360 TTTGGCAGTTAGGAAGGTTGTATCGCGGAGGAAGGA^ 

ill' ill ""III 391 401 ill 

MTSl 420 TTtAACAGAGTCAACCCACTCAAACAC^ 

ill isi" 441 451 461 471 

MTSl 480 CACCAGGCGAGCCCCWAGGGCGGTGTGGGGGGCACGTCCCGAGG^ 

\7l i« 501 sll si'!' 531 

MTSl 540 CTTGC^CGCTGGCTCTGGCGAGGGCTG^CGG 

... !~ ! gl 571 581 391 

MTSl 600 CCTGGGGCGACTTCAGGGCTCC^CATTCG^ 

ill 111 " 621 "I " l « Sl 

MTSl 660 CGAGCACTCGCTCACAGCGTCCCCTTGCCTGGAAAGATACCG 

ill ill "I 691 701 711 

FIG. 5A 
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MTS1 720 TGAGGGACAGGGTCGGAGGGGGCTCTTCCGCCAGCACCGGAGGAAGAAAGWGAGGGGCT 

721 731 741 751 761 771 

mSl 780 GGCTGGTCACCAGAGGGTGGGGCGGACCGCGTGCGCICGGCGGCTGCG6A 

791 801 811 821 831 

HTS1 840 GCAGGCAGCGGGCGGCGGGGAGCAGCATGGAGCCGGCGGCGGGGAGCAGCATGGAGCCTT 
pl6 0 AATTOGGCACGAGCCAGCaifiGAGCCTT 

841 851 86i 871 881 891 

KTS1 900 CGGCTGACTCCCTGGCCACCGCCGCGGCCCGGGCTCCGCTAGAGGAGGTGCGGGCGCTGC 
pl6 28 CGGCTGACTGGCTCGCCACGGCCGCGGCCCGGGGTCGGGTACACGAGGTCCCGGCGCTGC 

901 911 921 931 941 951 . 

KTS1 960 TGGAGGCCGTGGCGCTGCCCAACGCACCCAATAGTTACGGTCGGAGGCCCATCCAGGTGG 
pl6 88 TGGAGG0GGTGGCGCTGCCCAAC6CACCGAATAGTTACGGTCCGAGGCCCATCCAG 

961 971 961 991 1001 1011 

MTS1 1020 CTAGAGGGTCTGCAGCGGGAGCAGGGGATGGCGGGCGA 

1021 1031 1041 1051 1061 1071 

KTS 11080 CGGAATTCGAATCAGGTAGCGCTTCCATTCTCCGCAAAAAGGGCACGCTTCCTGGGGAGT 

1081 1091 1101 1111 1121 1131 

KTS1U40 




FIG. 5B 
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MTS1 0 GAATTCATTGtGTACTGAAgAATGGaTAGAGAACTCAAGAAGGAAaTTGGaAACTaGAAG 

j — — — ,- — ,. — t — 

KT51 60 CAAATGTAGGGGTAATTAGACACCTGGGGCTTGTGTGGGGCTC 

61 71 61 91 101 111 



1S1 



I 

171 



MTS1 120 CcTCTACACAAGCCTCCTTTCCCTCATGCCCGCCCCCACCCTGGCTCTG 
121 131 141 

MTS1 180 CTCTCTGGCAG^TCATGATGATGGGCAGCGCCCGAGTGGCGGAGCTGCTGCTGCTOCAM 
pl6 0 ^ GTCATGATGATGGGCAGCGCCCGAGTGGCGGAGCTGCTGCTGCTOCACG 



181 



191 



201 



J — 



211 



221 



231 



MTS1 240 GCGCGGAGCCCAACTGCGCCGACCCCGCCACTCTCACCCGACCCGTGCACGACGCTOCCC 
pl6 49 GCCCCGACCCCAACTCCGCCCACCCajCCACTCrCACCC^^ 

241 251 261 271 281 291 

KTS1 300 GGGAGGGcTTCCTGGACACGCTGGTGGTGCTGCACCGGGCCGGGGCGCGGCTGGACGTGC 
pl6 109 GGGAGGGCTTCCTGGACACGCTGGTGGTGCTGCACCGGGCCGGGGCGCGGCTGGACGTGC 

301 311 321 331 341 351 



KTS1 360 GCGATGCCTGGGGCCGTCTGCCCGTGGACCTGGCTGAGGAGCTGGGCCATCGCGATGTCG 
pi 6 169 GCCATGCCTGGGGCCGTCTGCCCGTGCACCTCGCTCAGGAGCTGGGCCATCGCGATGTCG 

; j ir~ 



361 



I 

371 



381 



391 



401 



411 



KTS1 4 20 
pl6 229 



HTS1480 
pl6 289 

I 

481 



CACGGTACCTGCGCGCGGCTCCGCCGGGCACCAGAGGCAGTAACCATCCCOGCATAGATG 
CACG GT ACCTG CG CGCGG CTG CG GGGGGCACCAG AGG CAGTAACCATGCCCC CATAGATC 



421 



431 



441 



451 



461 



471 



CCGCGGAAGGTCCCTCAGGTGAGGACTGATGATCTGAGAATlTGTACyCTGAGAGCTTCC 
CCGCCGAAGGTCCCTCAG 

1 

511 



i 

491 



t 

501 



i - 
521 



I 

531 



MTS1 540 AAAGCTCAGAGCATTCATTTTCCAGCACAGAAAGTTCAGCCCGGGAGACCAGTCTCCGGT 
J _ 1 . m i . i I j « 

541 551 561 571 581 591 



FIG. 6A 
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HTSl 600 CTTGCGCTCAeCTCACGMCCAAT^ 

soT ill «1 "1 641 « 51 

MTS1 660 CsCGCACAGCGCCGCAAATGCGAAATAATCCCGAAATGGACT^ 

' -- i-" lal 691 



661 671 



MTS1 720 TCTTGTACGTTATACTTCCCAAACCATACCACCACCCAAAttCCTACCCrCT6CTA6TTC 

i | j — --- — } — { — ■■ ~" i --— - 

4 2 1 731 741 751 761 771 

HTSl 780 AAGGOCTAGACTCC«»GCAATCAAGA^ 

I— ~±:: "ioi""" ill ~ 821 " 831 



781 791 



HTSl 840 CTCCAAACTAGCCCCAGTTCCATCsACTTACCA^ 

841 85l" 861"" 871 881 891 

HTSl 900 CTGGCTGGACCAACCTCaGGATTTCCAAACCCAATTGTC^ 

j }— ]-- j--- "»~ ^ S<1 



901 911 



931 931 941 9S1 



HTSl 960 CTCGATCTCtWCTCTTCTGCACAACTCAACTAATCTGACCCT 

III 971 9I1 iii iooi ion 

KTS1 1020 TCCGCTITATCCGGTAGAGlTTTCCAGAgCT^ 

1021 lMl" 1041 1031 1061 1071 

HTS11080 CAAGACTTCCCTCACCCTGgCAGTCTGTCCAC^ 

1081 1091 U01 llll" " 11" 1131 



HTSl 1140 



TCTgCAAGAGATCCCTATAGTGXCTCGTATTATnCGGCWCGAATTC^ 

iul 1151"" U61 1171 "81 

FIG. 6B 
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MTS2 0 CATCATCACTTTACCATCAACTTTCTrCTCTCTGaACGTTTACAC^TAAAATCCCMTT 

I i: ii 1; <i » 

HTS2 «0 W^gGTvCTGAGTvTAACCTCAaGGTCGCG^ 

|. I" « x {ox """ in 

MTS2 120 GAAACJiCCAGAeATCAGAGACCTGAACACCTCtGCACTGGjCT 

[H in "1 151 161 "1 

KTS2 1B0 TGTTTCTTTAAaTGGcTCCACCK;CC^ 

1m in"" 201 211 221 "I 

HTS2 240 CCC^CTCTGACCACTCTGCTCTCXCTG 

ill isi iel 271 281 291 

MTS2 300 CCGGAGCTGCTCCTGCTCcACGGCGCGGAGCCCAACTGCGCAGACCCTCC^CTCTCACC 
™ll 27 GCCGACCTCCrWCTG^CACGGCGCGGAG 

III ill " ill ' 111 341 351 

Tie 3 a%° SSSSSSS^^^^^^^^ 

ill 371 " "III 391 401 411 

MTS2 420 CCCGGCGCCCCCCTCCACCTGCGCGATGCCTGGGCTCGTCTGCCCGTCGACTTGGCC6A 
p" iJ? GC?CCCGCGCGCCT<^GTCCGCGATGCC^ 

\H \7l Ul 4U " "461 " ^471 

"K io? SSS=SS 

in i91 in ill 521 531 

MTS2 540 GTTCCXCACCCCCCCACAACGACTmCTTT^ 
pl6 267 AGTAACCATGCCCGCATAGATCCCCCGGAAGCTCCCTCAG. , 

• ( } > lei si 



541 SSI 561 571 581 Sfl 

FIG. 7A 
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<00 AATTCGATC^GGCTGCCAA^AWGGCCW^^TMGCCT^C 

■n It: Li ' "«i «i 651 



MTS2 660 GAGAC^CAGGAAGGACGXGCCGACCCGG^T^OTCCAWCA^ 
KTS2 720 TW«CICGCttWl«^ 

In 15" i« "*» " l 771 

„TS2 780 mAGGAGTGTGTGACATTCCAGGGGTCGlTrGnTTT^ 

j_ ;-~ l oi - ial B3i 

KTS2 840 CATATGAAATCCTTCACTGGACCTGGTGGCTA^GAATCTTTC 

i 4 1 iS 861 " 'hi iai 891 

MTS2 900 CTCCACCCCTCAGTGGGAGAAGGCA^^ 

ilT III i« & " *41 

MTS2 960 ttOCttUWCI^ 



961 971 



981 991 1001 1011 



HT821020 CGAAAACCTGAAGACrGAGGACA^GT^ 

UST i«I i04l" " i05l" i061 X071 

MTS21080 CnirtXWCCCTHWSTTXTO^ 

iJiT io^T Iioi" "ilu 1131 

KTS21140 CCCW*AGAC^C«^C^^ 

lm iisi ii6i ii7i iiei 1191 

MTS2 1200 GAOSACAGATCCTAAAATCC^^T^CGACGCTC^CAGCAC 

[ill liu 122l"" 1231 1241 

FIG.7B 
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mt<81 0 TGTGTGGGCWTCTGCTTGGCOflTGAGGGGGCTCTACACAAGCTTCCTTTCCGTCATGCCC 
MTS2 0 AATTACCTGTTTCTTTAAATCOCTCCACCTGCCTTGCCCCOGCCCGCATCTCCCATACCT 

- v , 

MTS1 60 GCCCCttCCCT^TCTGACCATTCTGTTCTCTCTGCCAGCTCATOATOATGGCCAOCOC 
HTS2 60 GCCCCCACCCTO6CTCT«C«CTCTOCTCXCTCTG6O0GTCM^ 

I I • 1 1 1 

MTS1 120 CCGAGT<WCGGAGC^CTGCTCCTCCACGGCGCC»AOCCCAACT«CGCCOACCCCOCCAC 
MTS2120 CCGCGTGGCGGAGCTGCTGCTCCTCCACGGCGCGGAGCCCAACTGCGCAGACCCTGCCAC . 

I I" • I " 1 1 

MTS1 180 TCTCACCCGACCCGTGCACGACGCTGCCCGGGAGCGCTTCCTGGACACOCTCGTOGTOCT 
MTS2 180 TCTCACCCGACCGGTGCATGATGCTGCCCGGGAGGGCTTCCTGGACACGCTGGTGGTGCT 

I I « ' 1 1 1 

MTS1240 GCACCCCGCCGGGGCGCGGCTGGACOTGCOCGAMCCT0GGOCCOTCTGCCCGTGGACCT 
MTS2 240 GCACCGGGCCGGGGCGCGGCTGGACGTGCGCGATGCCTGGGGTCGTCTGCCCGTGGACTT 



MTS1 300 GGCTGAGGAGCTGGGCCATCGCGATGTCGCACGGTACCIGCGCCCGGCTGCGGGGGGCAC 
MTS2 300 GGCCGAGGAGCGGCGCCACCGCGACGTTGCAGGGTACCTGCGCACAGCCACGGOGOACTG 

, , i-—. 

I I- I 1 1 '"" 

MTSl 420 ATCTGAGAATTTGTACYCTGAGAGCTTCCAAAGCTCA 
MTS2 420 CCCACCTAATTCGATGAAGGCTGCCAACGGGGAGCGG 

I I- I 1 

FIG. 8 
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CGGCCAGTGA GGACTCCGCG ACGCGTCCGC ACCCTGCGGC CAGAGCGGCT TT6AGCTCGG 60 

CTGCGTCCGC . GCTAGGCGCT TTTTCCCACA AGCAATCCAG GCGCGCCCCC TGGTTCTTGA 120 

GCGCCAGGAA AAGCCCGGAG CTAACGACCG GCCGCTCGGC CACTGCACGG GGCCCCAAGC 180 

CGCAGAAGGA CGACGOOAGO GTAATQAAGC TGAQCCCAOG TCTCCTAGOA AGGAGAGAOT 240 

GCGCCG6AGC AGCGTOGGAA AGAAGGGAAO AGTGTCGTTA AGTTTACGGC CAACGGTGGA 300 

TTATCCGGGC CGCTGCGCGT CTGGGGGCTG CGGA ATG CGC GAG GAG AAC AAG 352 

Met Arg Glu Glu Asn Lys 
1 5 

GGC ATG CCC ACT GGG GGC GGC AGC GAT GAG GOT CTG GCC AGC GCC OCG 400 
Gly Met Pro Ser Gly Gly Gly Ser Asp Glu Gly Leu Ala Ser Ala Ala 
10 « 20 

OCG CGG GGA CTA GTG GAG AAG GTG CGA CAO CTC CTG GAA GCC GGC GOG 448 
Ala Arg Gly Leu Val Glu Lys Val Arg Gin Leu Leu Glu Ala Gly Ala 

25 3° 35 J 

GAT CCC AAC GGA GTC AAC CGT TTC CGG AGO CGC GCG ATC CAG GTC ATG 496 
Asp Pro Asn Gly val Asn Arg Phe Gly Arg Arg Ala He Gin Val Met 
40 45 50 

ATG ATG GGC AGC GCC CGC GTG GCG GAG CTG CTG CTG CTC CAC GGC GCG 544 
Met Met Gly Ser Ala Arg Val Ala Glu Leu Leu Leu Leu His Gly Ala 
55 60 65 ™ 

GAG CCC AAC TGC GCA GAC CCT GCC ,.CT CTC ACC CGA CCG GTG CAT GAT 592 
Glu Pro Asn Cys Ala Asp Pro Ala Thr Leu Thr Arg Pro Val His Asp 
75 80 85 

GCT GCC CGG GAG GGC TTC CTG GAC ACQ CTG GTG GTG CTG CAC CGG GCC 640 
Ala Ala Arg Glu Gly Phe Leu Asp Thr Leu Val Val Leu His Arg Ala 
90 95 100 

GGG GCG CGG CTG GAC GTG CGC GAT GCC TGG GOT COT CTG CCC GTG CAC 688 
Gly Ala Arg Leu Asp Val Arg Asp Ala Trp Oly Arg Leu Pro Val Asp 
105 U0 US 

TTO GCC GAG GAG CGG GGC CAC CGC GAC GTT GCA GGG TAC CTG CGC ACA 736 
Leu Ala Glu Glu Arg Gly His Arg Asp Val Ala Gly Tyr Leu Arg Thr 
120 125 130 



GCC ACG GGG GAC TGA 
Ala Thr Gly Asp 

155 - FIG. 1 1 
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CGCGCCTGCG GGGCGGAGAT GGGCAGGGGG CGGTGCGTGG GTCCCAGTCT GCAGTTAAGG 60 
GGGCAGGAGT GGCGCTGCTC ACCTCTGGTG CCAAAGGGCG GCGCAGCGGC TGCCGAGCTC 120 
GGCCCTGGAG GCGGCGAGAA CATGGTGCGC AGGTTCATGO TGACCCTCCG GATTCGGCGC 180 
GCGTGCGGAC CGCCGCGAGT GAGGGTTTTC GTGGTTCACA TCCCGCGGCT CACGGGGGAG 240 
•TGGGCAGCAC CAGGGGCGCC CGCCGCTGTG GCCCTCGTGC TGATGCTACT GAGGAGCCAG 300 
CGTCTAGGGC AGCAGCCGCT TCCTAGAAGA CCA<Lc ATG ATG ATG GCC ACC CCC 3SS 

1 5 

CGA GTG GCG GAG CTG CTG CTG CTC CAC GGC GCG GAG CCC AAC TGC GCC 403 
Aro val Ala Glu Leu Leu Leu Leu His Gly Ala Glu Pro Asn Cys Ala 
10 IS 20 

GAC CCC GCC ACT CTC ACC CGA CCC GTG CAC CAC GCT GCC C6G GAG GGC 451 
Asp Pro Ala Thr Leu Thr Arg Pro Val His Asp Ala Ala Arg Glu Gly 
25 30 35 

TTC CTG GAC ACQ CTG GTG GTG CTG CAC CGO GCC GGG GCG CGG CTG GAC 499 
Phe Leu Asp Thr Leu Val. Val Leu His Arg Ala Gly Ala Arg Leu Asp 
40 «5 50 

GTG CGC GAT GCC TOG GGC CGT CTG CCC GTG GAC CTG GCT GAG GAG CTG 
val Arg Asp Ala Trp Gly Arg Leu Pro Val Asp Leu Ala Glu Glu Leu 
55 60 65 ™ 
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GGC CAT CGC GAT GTC GCA CGG TAG CTG CGC GCG OCT GCG GGG GGC ACC 595 
Gly His Arg Asp Val Ala Arg Tyr Leu Arg Ala Ala Ala Gly Gly Thr 
75 80 85 

♦ 

AGA GGC ACT AAC CAT GCC CGC ATA GAT GCC GCG GAA GOT CCC TCA GAC 643 
Arg Gly Ser Asn His Ala Arg lie Asp Ala Ala Glu Gly Pro ser Asp 
90 95 100 



ATC CCC GAT TGAAAGAACC AGAGAGGCTC TGAGAAACCT OGGGAAAGTT 692 
lie Pro Asp 
105 



AGATCATCAG 


TCACCGAAGG 


TCCTACAGGG 


CCACAACTGC CCCCGCCACA ACCCACCCCG 752 


CTTTCGTAGT 


TTTCATTTAG 


AAAATAGAGC 


TTTTAAAAAT GTCCTGCCTT TTAACGTAGA 812 


TATAAGCCTT 


CCCCCACTAC 


CGTAAATSTC 


CATTTATATC ATTTTTTATA TATTCTTATA 872 


AAAATGTAAA 


AAAGAAAAAC 


ACCGCTTCTG 


CCTTTTCACT GTGTTGGAGT TTTCTGGAGT 932 


GAGCACTCAC 


GCCCTAAGCG 


CACATTCATG 


TGGGCATTTC TTGCGAGCCT CGCAGCCTCC 992 


GGAAGCTGTC 


GACTTCATGA 


CAAGCATTTT 


GTGAACTAGG GAAGCTCAGG GGGGTTACTG 1052 


GCTTCTCTTG 


AGTCACACTG 


CTAGCAAATG 


GCAGAACCAA AGCTCAAATA AAAATAAAAT 1112 


TATTTTCATT 


CATTCACTC 
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mouse* aaga6A666TTTTCtT66TaaAgtTCgt6C6atcCcg6a6accca6 
human? ccgcgagtGAG66nnCgTMTtcAcaTCccGC6ga 

1 10 20 30 40 50 

mouse * 
human* 

51 60 70" ""80 90 y 100 

mouse* AGGAtCt1^GaagAGG6CcGCAcCgGaaTCCTg6A---CCAGGTg^ 
human? AGGAgCcaTEGtc!^^ 

101 110 120 130 140~~ 150 

mouse* TGATGGGCAaCGttCacGTaGCaGctCTtCTGgcaaCtACG6tGCaGAt 
human* TGATGMCAgCGccCgaGTgGCgGagCTgCTGCTgctCcACM 

151 160 170 180 "l90 200 

mouse* tCgAACTGCGagGACCCCaCtACctTCtCCCGcCCgGTGCACGACGC^ 
human* cCcAAaGCGccGACCCCgCcACtcTCaCCCGaCCcGTGCACGACGC^ 

201 210 220 230 256 250 

est mmnagmgmgs 

253 260 270* 280 290 300 

mouse * GGCTGGA tGTcCGCGATGCCTGGGGtCGcCTcCCgcTcGACTTcGCCcAa 
human* GGCTGGAcGTgCGCGATGCCTGGGGcCGtCTgCCcgTgGA^ 

301 310 325 330 340* 350 

mouse* GAGCgGGGaCATCaaGAcaTCGtflCGaTAttT6CGttCc6CTGgGtGctC 
human* GAGCtGGGcCATCgcGAtgTCGcaC6gTAccTGCGcgCg6CT^ 

351 360 370 380 390* 100 

human? CACCAGAGGCAGTAACCATGCCCGCATAGATGCCGCGG^ 

... ,-- «~ ^30 440 450 
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0 ATGGAGCCGGCGGC6GGGAGCAGCAT6GAGCCTTCGGCTGACTGGCTGGCCACGGCCGCG 

6 ii "~~26 "36 46 — 

60 GCCCGGGGTCGGGTAGAGGAGGTGCGGGCGCTGCTGGAGGCGGTGGCGCTGCCCAACGCA 

66" — '"""gg"* ""95"" """"lor" ~116~ 
120 CCGAATAGTTACG6TCGGAGGCCGATCCAGGTCATGATGATGGGCAGC6CCCGAGTGGCG 

126 136 146 156 166 176 
180 GAGCTGCTGCTGCTCCACGGC6C6GA6CCCAACTGCGCCGACCCCGCCACTCTCACCCGA 

186 IS ~206 "216 226 236 
240 CCCGTGGACGACGCTGCCCGGGAGGGCTTCCTGGACACGCTG6TGGTGCT6CACCGGGCC 

246 256 266 276 *286 296 
300 GG6GCGCGGCTGGACGT6CGCGATGCCTGGGGCCGTCT6CCC6TGGACCTGGCTGAGGAG 

306 316 ^326 336 346*" 356* 
360 CTGGGCCATCGCGATGTCGCAC6GTACCTGCGCGCGGCTGC6GGGGGCACCAGAGGCAGT 

366 376 386 396 ~ i5i~ 416 
420 AACCATGCCC6CATAGAT6CCGCGGAAGGTCCCTCAG~ACATCCCCGATTGAAA6AACC 

426~ 436 446 456 466 476 
478 AGAGAGGCTCTGAGAAACCTCG6GAAACTTAGATCATCAGTCACCGAAGGTCCTACAGGG 

486 """" _ 496 "' Sol' 516 526 536 " 
538 CCACAACTGCCCCCGCCACAACC 

546 556 566 576 586 ""596 
598 TTnAAAAATGTCCTGCCTTTTAACGTAGATATAAGCCTTCCCCCACTACC6TAAAT6TC 

606 616 626 636 646 656 
658 CATTTATATCATTTTnATATATTmATAAAAAT6TAAAAAA6AAAAACA^ 

666" ""676"* ""686"" "695 706 716 

718 9SZZZ3E-EZ^Z-ZT-— — — TTZT^^^*^^*?^^^*^^*^^^^* agcgcacattcatg 

726 " 736 """"""746 756 766 776 
778 TG6GCATncnGCGAGCCTC6CA6CCTCC6GAAGCT6TC6ACnCATGACAAGCATTTT 

786" "796"* "iol" """816 826 836 
838 GTGAACTAGGGAAGCTCAGGGGGpACTGGCn 

846 * "856" "866" "876"" ""886 896 



898 GCAGAACCAAAGCTCAAATAAAAATAAAATTATTTTCATTCATTCACTC 
906 916 926 "936 946 "" 
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